IMIS | Flanders Marine Institute
 

Flanders Marine Institute

Platform for marine research

IMIS

Publications | Institutes | Persons | Datasets | Projects | Maps
[ report an error in this record ]basket (0): add | show Print this page

The world bacterial biogeography and biodiversity through databases: A case study of NCBI Nucleotide database and GBIF database
Selama, O.; James, P.; Nateche, F.; Wellington, E.M.H.; Hacène, H. (2013). The world bacterial biogeography and biodiversity through databases: A case study of NCBI Nucleotide database and GBIF database. Biomed. Res. Int. 2013: 1-11. https://hdl.handle.net/10.1155/2013/240175
In: BioMed Research International. Hindawi: New York. ISSN 2314-6133; e-ISSN 2314-6141, more
Peer reviewed article  

Available in  Authors 

Authors  Top 
  • Selama, O.
  • James, P.
  • Nateche, F.
  • Wellington, E.M.H.
  • Hacène, H.

Abstract
    Databases are an essential tool and resource within the field of bioinformatics. The primary aim of this study was to generate an overview of global bacterial biodiversity and biogeography using available data from the two largest public online databases, NCBI Nucleotide and GBIF. The secondary aim was to highlight the contribution each geographic area has to each database. The basis for data analysis of this study was the metadata provided by both databases, mainly, the taxonomy and the geographical area origin of isolation of the microorganism (record). These were directly obtained from GBIF through the online interface, while E-utilities and Python were used in combination with a programmatic web service access to obtain data from the NCBI Nucleotide Database. Results indicate that the American continent, and more specifically the USA, is the top contributor, while Africa and Antarctica are less well represented. This highlights the imbalance of exploration within these areas rather than any reduction in biodiversity. This study describes a novel approach to generating global scale patterns of bacterial biodiversity and biogeography and indicates that the Proteobacteria are the most abundant and widely distributed phylum within both databases.

All data in the Integrated Marine Information System (IMIS) is subject to the VLIZ privacy policy Top | Authors