Introduction

The Genome Sequence Archive (GSA) is a data repository for genome, transcriptome and other omics primitive sequencing data. It archives raw sequence data produced from a wide variety of sequencing platforms. In addition to raw sequencing data, GSA also accommodates secondary analyzed files in acceptable formats (like BAM, VCF).

GSA is one of database resources in BIG Data Center (BIGD), part of Beijing Institute of Genomics (BIG), Chinese Academy of Sciences (CAS), serving as a primary archive of genome sequencing data for worldwide institutions and laboratories.

Compatible with standards and structures adopted in extant archives in International Nucleotide Sequence Database Collaboration, GSA covers the spectrum of raw sequencing reads, accepts the submissions from all over the world, archives sequencing data and metadata and makes these data publicly available to worldwide scientific communities.

China Genomic Data Sharing Initiative

Data Accessibility

GSA is an open access resource freely available to scientific communities throughout the world. All released data in GSA can be publicly accessible, whereas the unreleased data is controlled. Data statistics of GSA can be found here.

For more information, please see the GSA Documentation!

How to Cite

When you have successfully submitted data to GSA, please consider to use the following words to describe data deposition in your manuscript.

The raw sequence data reported in this paper have been deposited in the Genome Sequence Archive (Genomics, Proteomics & Bioinformatics 2017) in BIG Data Center (Nucleic Acids Res 2017), Beijing Institute of Genomics (BIG), Chinese Academy of Sciences, under accession numbers PRJCAxxxxxx, PRJCAyyyyyy that are publicly accessible at http://bigd.big.ac.cn/gsa.

Please cite the following required publications.

GSA: Genome Sequence Archive. Genomics, Proteomics & Bioinformatics 2017, 15(1): 14-18. doi:10.1016/j.gpb.2017.01.001.
The BIG Data Center: from deposition to integration to translation. Nucleic Acids Res 2017, 45(D1): D18-D24. [PMID=27899658]

News
  • Data deposited to GSA has been reported by a paper published in Mol Biol Evol. (2017-02-17)
  • Data deposited to GSA has been reported by a paper published in Genome Research. (2016-10-21)
  • Data deposited to GSA has been reported by a paper published in AJHG. (2016-09-03)
  • Data deposited to GSA has been reported by a paper published in Current Biology. (2016-08-20)
  • Data deposited to GSA has been reported by a paper published in Stem Cell Reports. (2016-07-14)
  • Data deposited to GSA has been reported by a paper published in Journal of Cell Science. (2016-05-17)
  • GSA (release 1.1) is now available with bug fixes and updates. (2016-01-22)
  • Data deposited to GSA has been reported by a paper published in Cell Research. (2015-12-24)
  • Data deposited to GSA has been reported by a paper published in PNAS. (2015-11-10)
Latest Released Projects
Accession Description
PRJCA000258
  (2017-02-09)
RNA-seq analysis of flower related genes Rosa chinensis during...
PRJCA000348
  (2017-02-07)
Lactobacillus plantarum HNU082
PRJCA000342
  (2017-01-20)
These research aims to study the influence of bowl cleasing...
PRJCA000333
  (2017-01-19)
we report a genome disorganization study from two patients...
PRJCA000329
  (2017-01-12)
The recent studies suggest that antisense transcripts play...
PRJCA000273
  (2017-01-06)
PAR-CLIP sequencing for YTHDF1 and YTHDF3 in human HeLa cells
Problems or Questions?
As birthed on May 2015, GSA is still at its infant stage. If you have any question or would like to give us any suggestion/comment or report a bug, please feel free to contact us. Email: gsa@big.ac.cn QQ group: 548170081 We highly appreciate your comments and suggestions for further improving its functionalities and providing better services.
Similar Database Links