About RCSB PDB: Enabling Breakthroughs in Scientific and Biomedical Research and Education


The Protein Data Bank (PDB) was established as the 1st open access digital data resource in all of biology and medicine (Historical Timeline). It is today a leading global resource for experimental data central to scientific discovery.

Through an internet information portal and downloadable data archive, the PDB provides access to 3D structure data for large biological molecules (proteins, DNA, and RNA). These are the molecules of life, found in all organisms on the planet.

Knowing the 3D structure of a biological macromolecule is essential for understanding its role in human and animal health and disease, its function in plants and food and energy production, and its importance to other topics related to global prosperity and sustainability.

RCSB PDB operates the US data center for the global PDB archive, and makes PDB data available at no charge to all data consumers without limitations on usage (Policies).

The Vision of the RCSB PDB is to enable open access to the accumulating knowledge of 3D structure, function, and evolution of biological macromolecules, expanding the frontiers of fundamental biology, biomedicine, and biotechnology.

Recognized experts in fields, including but not limited to, structural biology, cell and molecular biology, computational biology, information technology, and education serve as advisors to the RCSB PDB.

PDB Archive contains >1 TB of Structure Data for Proteins, DNA, and RNA

The cost to replicate the contents of the PDB archive is estimated at
$14 billion (Analysis)

The PDB Archive

  • Grows at the rate of nearly 10% per year
  • Used to download >1.8 Million structure data files per day
  • Managed by International collaboration US-Asia-Europe
  • Manages “Big Data” as global Public Good

PDB Data Impact

  • Basic and applied research
  • Patent applications
  • Discovery of lifesaving drugs
  • Innovations that can lead to new product development and company formation
  • STEM education: PDB-101 provides curricula and online tools for teachers and students

>1,000,000 Data Consumers worldwide served every year

Researchers, scientists, educators, students, curious public, medical professionals, patients, and patient advocates

Public and Private sectors, including pharmaceutical and biotechnology companies

Generates return on investment of
~1,500 times federal funding (Analysis)


Download Printable PDF




Supporting Access to the Biological Molecules of the PDB Archive

  • Deposition/Biocuration Services support Data Depositors who deposit the results of their structural studies of biological macromolecules to the PDB. All data deposited undergo expert review. Each structure is examined for self-consistency, standardized using controlled vocabularies, cross-referenced with other biological data resources, and validated for scientific/technical accuracy.
  • Archive Management/Access Services support PDB Data Consumers by maintaining the PDB archive; data dictionary development and standardization, enabling global data delivery and DOI registration, and integrating PDB data with other available information.
  • Data Exploration Services support PDB Data Consumers in the US and around the world through our open-access web portal RCSB.org that provides tools for structure visualization and analysis.
  • Outreach/Education Services for teachers, students, and the general public are primarily delivered via our PDB-101 website (“101", as in an entry-level course).



Funding



Users and Impact

RCSB PDB supports an international community of users, including biologists (in fields such as structural biology, biochemistry, genetics, pharmacology); other scientists (in fields such as bioinformatics, software developers for data analysis and visualization); students and educators (all levels); media writers, illustrators, textbook authors; and the general public.

RCSB PDB services have broad impact across research and education. The inaugural RCSB PDB citation (Berman et al., Nucleic Acids Research 2000) is one of the top-cited scientific publications of all time. A 2017 bibliometric analysis performed by Clarivate Analytics shows PDB motivated high-quality research throughout the world. Papers citing had a citation-based impact exceeding the world-average in 16 scientific fields including Biology & Biochemistry, Computer Science, Plant & Animal Sciences, Physics, Environment/Ecology, Mathematics and Geosciences.

A 2017 economic analysis performed by the Rutgers Office of Research Analytics noted that a reasonable estimate to replicate the PDB data archive at the time was $12 billion.

  • Supporting the NSF Big Ideas (PDF)
  • Supporting NIH in Medical Research (PDF)
  • Supporting the Research Goals of DOE (PDF)

Collaborations

Worldwide Protein Data Bank (wwPDB)

The Worldwide Protein Data Bank (wwPDB) was formed to maintain a single PDB archive of macromolecular structural data that is freely and publicly available to the global community. It consists of organizations that act as deposition, data processing and distribution centers for PDB data. As the US Data Center, RCSB PDB biocurates structures submitted from the Americas and Oceania.

EMDataBank

EMDataBank provides access to 3DEM density maps and metadata, news, events, software tools, data standards, and validation methods.

Nucleic Acid Database

NDB contains information about experimentally-determined nucleic acids and complex assemblies.