Upcoming Changes to NCBI Taxonomy Classifications

Upcoming Changes to NCBI Taxonomy Classifications

NCBI is continually making improvements to our Taxonomy resource in response to new data and changes in biological nomenclature and classification. In the coming months, we will update the higher-level classification of birds (Aves), budding yeasts (Saccharomycotina), prokaryotes (Bacteria and Archaea) and Viruses. This update will also change the formal ranks of several high-level taxonomic names including Eukaryota. Except for the new species names for Viruses, none of these changes will affect organism names at the species level or below.  

Here is a brief overview of changes to each group in the order we plan to make them. Stay tuned for upcoming posts, which will describe the changes for each category in more detail.  Continue reading “Upcoming Changes to NCBI Taxonomy Classifications”

Explore Population Genetics in dbSNP with NCBI’s Allele Frequency Aggregator (ALFA)

Explore Population Genetics in dbSNP with NCBI’s Allele Frequency Aggregator (ALFA)

Access to comprehensive and accurate allele frequency data is essential to understanding the impact of genetic variations on human health and disease. Allele Frequency Aggregator (ALFA) provides the Database of Single Nucleotide Polymorphisms (dbSNP) with allele frequency data for 200K subjects from the Database of Genotypes and Phenotypes (dbGaP). ALFA adheres to the Findable, Accessible, Interoperable, and Reusable (FAIR) Data Principles providing open access to valuable allele frequency data.   Continue reading “Explore Population Genetics in dbSNP with NCBI’s Allele Frequency Aggregator (ALFA)”

NCBI & NIAID Join Forces to Co-Host Virtual AMR Codeathon

NCBI & NIAID Join Forces to Co-Host Virtual AMR Codeathon

Resistance is Futile: A codeathon to combat antimicrobial resistance 

Do you perform antimicrobial resistance (AMR) research, or would you like to join a team to work on microbial and AMR data analysis? NCBI is joining forces with the National Institute of Allergy and Infectious Diseases (NIAID) to co-host a virtual codeathon, Resistance is Futile: A codeathon to combat antimicrobial resistance, which will take place September 23-27, 2024.   Continue reading “NCBI & NIAID Join Forces to Co-Host Virtual AMR Codeathon”

New Data Available! Access Avian Influenza A (H5N1) Virus Sequences at NCBI

New Data Available! Access Avian Influenza A (H5N1) Virus Sequences at NCBI

Sequence data from the ongoing avian influenza A (H5N1) virus outbreak in cattle are now available through NLM’s NCBI resources NCBI Virus and NCBI Datasets.

These data were submitted by the U.S. Department of Agriculture (USDA), U.S. Centers for Disease Control and Prevention (CDC), the World Health Organization (WHO), Iowa State University, and St. Jude Children’s Research HospitalContinue reading “New Data Available! Access Avian Influenza A (H5N1) Virus Sequences at NCBI”

Ortholog Groups Added for ~2 Million Insect Genes

Ortholog Groups Added for ~2 Million Insect Genes

Find evolutionarily related genes across insects and other arthropods on our new Ortholog webpages

NCBI recently released a set of orthologs for approximately 2 million insect genes. You can now find and access the orthologous genes, transcripts, and proteins by searching a species and gene name in NCBI All Databases, NCBI Gene, or NCBI Datasets. As previously described, these orthologs are based on comparisons to the Drosophila melanogaster annotated genome. Using Drosophila gene nomenclature for orthologs should lead to more informative gene symbols for insects and other arthropods.  Continue reading “Ortholog Groups Added for ~2 Million Insect Genes”

International Nucleotide Database Collaboration (INSDC) Introduces Enhanced Website

International Nucleotide Database Collaboration (INSDC) Introduces Enhanced Website

Aims to broaden INSDC membership and attract diverse new members

The National Center for Biotechnology Information (NCBI) at the National Library of Medicine (NLM) and other Founding Members of the International Nucleotide Database Collaboration (INSDC) have enhanced its website, www.insdc.org, to provide comprehensive information on how interested parties from around the world can evaluate their readiness to participate in the INSDC. This effort supports INSDC’s aim to broaden membership and attract qualified nucleotide sequence databases. Web content now includes a formalized Founders Arrangement and a Membership Arrangement, along with other updated information about the INSDC mission, vision, governance, and technical documentation. In doing so, INSDC encourages interested parties to visit the INSDC website to learn more. Continue reading “International Nucleotide Database Collaboration (INSDC) Introduces Enhanced Website”

New! RefSeq Release 224

New! RefSeq Release 224

Check out RefSeq release 224, now available online and from the FTP site. You can access RefSeq data through NCBI Datasets.

What’s included in this release?

As of May 6, 2024, this full release incorporates genomic, transcript, and protein data containing:

  • 435,879,646 records
  • 324,246,652 proteins
  • 62,348,147 RNAs
  • Sequences from 150,742 organisms

The release is provided in several directories as a complete dataset and also as divided by logical groupings. Continue reading “New! RefSeq Release 224”

Automated Lineage Definitions Now Available in NCBI Virus SARS-CoV-2 Variants Overview

Automated Lineage Definitions Now Available in NCBI Virus SARS-CoV-2 Variants Overview

Recently, NCBI Virus SARS-CoV-2 Variants Overview moved from a manual to an automated process for selecting mutations required to define a lineage (e.g., Omicron, BA.2, JN.1, etc.). With this update, the SARS-CoV-2 Variant Overview provides coverage for all SARS-CoV-2 lineages and is no longer limited to only lineages with CDC status. The SARS-CoV-2 Variants Overview website reports results from analyzing both GenBank and unassembled Sequence Read Archive (SRA) sequence data. It allows you to view geographic and frequency trends of records assigned to Pango lineages and search for sequence records using lineage-defining or other mutations (example shown in Figure 1)  Continue reading “Automated Lineage Definitions Now Available in NCBI Virus SARS-CoV-2 Variants Overview”

NCBI Pathogen Detection Presents the Antibiotic Susceptibility Test (AST) Browser

NCBI Pathogen Detection Presents the Antibiotic Susceptibility Test (AST) Browser

Have you ever wanted to compare antibiotic resistance data and resistance gene calls in bacteria? Now you can! Easily access and browse antibiotic susceptibility testing (AST) data and link to other NCBI resources using the new AST Browser. NCBI has collected AST data for many isolates in the Pathogen Detection system.  

Features and Benefits 
  • Data is in a searchable, tabular format 
  • Download data for further analysis 
  • Use the Cross-browser selection tool to link out to the Isolates Browser or MicroBIGG-E to identify the isolates and the genetic elements associated with each AST result 

Continue reading “NCBI Pathogen Detection Presents the Antibiotic Susceptibility Test (AST) Browser”