Accountability & fraud: the risks of community data
GenBank receives over 3 million new DNA sequences per month from a large number of contributors, dispersed al over the world.Though not common, erroneous and low-quality and/or even fraudulent data is difficult to identify.
The error rate in GenBank has remained at just 0.1% over many years, even though the amount of sequence data in the database had reached 100Gb by 2005.