GCP Home Page

Accountability & fraud: the risks of community data

GenBank receives over 3 million new DNA sequences per month from a large number of contributors, dispersed al over the world.

Though not common, erroneous and low-quality and/or even fraudulent data is difficult to identify.

The error rate in GenBank has remained at just 0.1% over many years, even though the amount of sequence data in the database had reached 100Gb by 2005.

!You must be ready to personally confirm ALL data!