Introduction

The Genome Sequence Archive (GSA) is a data repository for genome, transcriptome and other omics primitive sequencing data. It archives raw sequence data produced from a wide variety of sequencing platforms. In addition to raw sequencing data, GSA also accommodates secondary analyzed files in acceptable formats (like BAM, VCF).

GSA is one of database resources in BIG Data Center (BIGD), part of Beijing Institute of Genomics (BIG), Chinese Academy of Sciences (CAS), serving as a primary archive of genome sequencing data for worldwide institutions and laboratories.

Compatible with standards and structures adopted in extant archives in International Nucleotide Sequence Database Collaboration, GSA covers the spectrum of raw sequencing reads, accepts the submissions from all over the world, archives sequencing data and metadata and makes these data publicly available to worldwide scientific communities.

China Genomic Data Sharing Initiative

Data Accessibility

GSA is an open access resource freely available to scientific communities throughout the world. All released data in GSA can be publicly accessible, whereas the unreleased data is controlled. Data statistics of GSA can be found here.

For more information, please see the GSA Documentation!

How to Cite

When you have successfully submitted data to GSA, please consider to use the following words to describe data deposition in your manuscript.

The raw sequence data reported in this paper have been deposited in the Genome Sequence Archive (Genomics, Proteomics & Bioinformatics 2017) in BIG Data Center (Nucleic Acids Res 2017), Beijing Institute of Genomics (BIG), Chinese Academy of Sciences, under accession numbers PRJCAxxxxxx, PRJCAyyyyyy that are publicly accessible at http://bigd.big.ac.cn/gsa.

Please cite the following required publications.

GSA: Genome Sequence Archive. Genomics, Proteomics & Bioinformatics 2017, 15(1): 14-18. doi:10.1016/j.gpb.2017.01.001.
The BIG Data Center: from deposition to integration to translation. Nucleic Acids Res 2017, 45(D1): D18-D24. [PMID=27899658]

News
  • Data deposited to GSA has been reported by a paper published in Cell Research. (2017-05-02)
  • Data deposited to GSA has been reported by a paper published in Mol Biol Evol. (2017-02-17)
  • Data deposited to GSA has been reported by a paper published in Genome Research. (2016-10-21)
  • Data deposited to GSA has been reported by a paper published in AJHG. (2016-09-03)
  • Data deposited to GSA has been reported by a paper published in Current Biology. (2016-08-20)
  • Data deposited to GSA has been reported by a paper published in Stem Cell Reports. (2016-07-14)
  • Data deposited to GSA has been reported by a paper published in Journal of Cell Science. (2016-05-17)
  • GSA (release 1.1) is now available with bug fixes and updates. (2016-01-22)
  • Data deposited to GSA has been reported by a paper published in Cell Research. (2015-12-24)
  • Data deposited to GSA has been reported by a paper published in PNAS. (2015-11-10)
Latest Released Projects
Accession Description
PRJCA000315
  (2017-04-30)
RNA m5C sequencing in human HeLa cells and mouse tissues
PRJCA000416
  (2017-04-11)
A Comparative Transcriptomic Analysis of Uveal Melanoma...
PRJCA000415
  (2017-04-06)
Comprehensive simulation of metagenomic sequencing data...
PRJCA000397
  (2017-03-24)
o2n-seq
PRJCA000392
  (2017-03-24)
EasyMF
PRJCA000378
  (2017-03-20)
Droplet-Cirseq
Problems or Questions?
As birthed on May 2015, GSA is still at its infant stage. If you have any question or would like to give us any suggestion/comment or report a bug, please feel free to contact us. Email: gsa@big.ac.cn QQ group: 548170081 We highly appreciate your comments and suggestions for further improving its functionalities and providing better services.
Similar Database Links