Working directories
1 | # Book of "Protein Structure Prediction" |
Protein Interaction/PPI
- Useful Numbers for Cell Culture | Thermo Fisher Scientific - CN
- Zebra | Sigma-Aldrich
- 结合蛋白预测 - Bing
- 科学网—PPA-Pred—-蛋白互作预测的工具 - 熊朝亮的博文
- 这大概是我最想推荐的蛋白结构预测网站了:Phyre2
- 值得收藏——10个蛋白结构分析预测和相关信号转导网站 - 简书
- How to find out the binding sites in a given protein?
- NCBI Conserved Domain Search
- Welcome to NCBI Domain architecture search
- 1472-6807-14-18.pdf
- Online Analysis Tools - Motifs
- Large-scale prediction of protein-protein interactions from structures
- Frontiers | Large-Scale Protein Interactions Prediction by Multiple Evidence Analysis Associated With an In-Silico Curation Strategy | Bioinformatics
- BSpred On-line Server
- PepBind results
- PepBind results
- bSiteFinder, an improved protein-binding sites prediction server based on structural alignment: more accurate and less time-consuming | Journal of Cheminformatics | Full Text
- 上海海事大学单点登录系统
- protein interaction spectrum prediction - Bing
- IntAct - Search Results
- 寻找结合蛋白 - Bing
- 结合蛋白质谱分析_质谱分析寻找结合蛋白_分析结合蛋白质谱|百泰派克生物科技
- 这四个网站了解一下,让你轻松预测相互作用蛋白! – 王进的个人网站
- GRN - Progranulin precursor - Homo sapiens (Human) - GRN gene & protein
- string - Bing
- NFX1 protein (human) - STRING interaction network
- Useful Numbers for Cell Culture | Thermo Fisher Scientific - CN
- Zebra | Sigma-Aldrich
- 结合蛋白预测 - Bing
- 科学网—PPA-Pred—-蛋白互作预测的工具 - 熊朝亮的博文
- 这大概是我最想推荐的蛋白结构预测网站了:Phyre2
- 值得收藏——10个蛋白结构分析预测和相关信号转导网站 - 简书
- How to find out the binding sites in a given protein?
- NCBI Conserved Domain Search
- Welcome to NCBI Domain architecture search
- 1472-6807-14-18.pdf
- Online Analysis Tools - Motifs
- Large-scale prediction of protein-protein interactions from structures
- Frontiers | Large-Scale Protein Interactions Prediction by Multiple Evidence Analysis Associated With an In-Silico Curation Strategy | Bioinformatics
- BSpred On-line Server
- PepBind results
- PepBind results
- bSiteFinder, an improved protein-binding sites prediction server based on structural alignment: more accurate and less time-consuming | Journal of Cheminformatics | Full Text
- 上海海事大学单点登录系统
- protein interaction spectrum prediction - Bing
Drive5.com
USEARCH >
Extreme high-throughput sequence analysis. Orders of magnitude faster than BLAST.
MUSCLE >
Multiple sequence alignment. Faster and more accurate than CLUSTALW.
UPARSE >
OTU clustering for 16S and other marker genes. Highly accurate OTU sequences and improved diversity measures.
UCHIME >
Chimeric sequence detection.
PILER >
De novo genome repeat finder.
PILER-CR >
Detection of CRISPR repeats in bacterial genomes.
URMAP >> new
Ultra-fast read mapper.
PALS >
Whole-genome alignment.
PREFAB >
Protein Reference Alignment Database.
MSA benchmark collection >
Selected multiple alignment benchmarks in a standardized FASTA format.
AlphaFold Protein Structure Database
Home
Background
AlphaFold is an AI system developed by DeepMind that predicts a protein’s 3D structure from its amino acid sequence. It regularly achieves accuracy competitive with experiment.
DeepMind and EMBL’s European Bioinformatics Institute (EMBL-EBI) have partnered to create AlphaFold DB to make these predictions freely available to the scientific community. The first release covered the human proteome and the proteomes of several other key organisms, while the second release added the majority of manually curated UniProt entries (Swiss-Prot). In 2022 we plan to expand the database to cover a large proportion of all catalogued proteins (the over 100 million in UniRef90).
Q8I3H7: May protect the malaria parasite against attack by the immune system. Mean pLDDT 85.57.
In CASP14, AlphaFold was the top-ranked protein structure prediction method by a large margin, producing predictions with high accuracy. While the system still has some limitations, the CASP results suggest AlphaFold has immediate potential to help us understand the structure of proteins and advance biological research.
Let us know how the AlphaFold Protein Structure Database has been useful in your research, or if you have any questions not covered in the FAQs, at alphafold@deepmind.com.
Q8W3K0: A potential plant disease resistance protein. Mean pLDDT 82.24.
Find out more
MethodologyHuman proteome predictionsDownloads
About AlphaFold DBDeepMindEMBL-EBI
What’s next?
AlphaFold DB will continue to expand in the coming months, so if you can’t find what you’re looking for right now, please follow DeepMind and EMBL-EBI’s social channels for updates. In the meantime the AlphaFold source code and Colab notebook can be used to predict the structures of proteins not yet in AlphaFold DB. Both resources were recently updated to support multimer prediction.
License and attribution
All of the data provided is freely available for both academic and commercial use under Creative Commons Attribution 4.0 (CC-BY 4.0) license terms. If you use this resource, please cite the following papers:
Jumper, J et al. Highly accurate protein structure prediction with AlphaFold. Nature (2021).
Varadi, M et al. AlphaFold Protein Structure Database: massively expanding the structural coverage of protein-sequence space with high-accuracy models. Nucleic Acids Research (2021).
The structures provided in this resource are predictions with varying levels of confidence and should be interpreted carefully.
Downloads
AlphaFold DB currently provides predicted structures for the organisms listed below, as well as the majority of Swiss-Prot.
You can download a prediction for an individual UniProt accession by visiting the corresponding structure page (example: https://www.alphafold.ebi.ac.uk/entry/F4HVG8).
For downloading all predictions for a given species, use the download links below. Note that this option is only available on the desktop version of the site.
These uncompressed archive files (.tar) contain all the available compressed PDB and mmCIF files (.gz) for a reference proteome. In the case of proteins longer than 2700 amino acids (aa), AlphaFold provides 1400aa long, overlapping fragments. For example, Titin has predicted fragment structures named as Q8WZ42-F1 (residues 1–1400), Q8WZ42-F2 (residues 201–1600), etc. These fragments are currently only available in these proteome archive files, not on the website.
For downloading all predictions for all species, visit the FTP site: ftp://ftp.ebi.ac.uk/pub/databases/alphafold
Compressed prediction files for model organism proteomes:
Species
Common Name
Reference Proteome
Predicted Structures
Download
Arabidopsis thaliana
Arabidopsis
27,434
Caenorhabditis elegans
Nematode worm
19,694
Candida albicans
C. albicans
5,974
Danio rerio
Zebrafish
24,664
Dictyostelium discoideum
Dictyostelium
12,622
Drosophila melanogaster
Fruit fly
13,458
Escherichia coli
E. coli
4,363
Glycine max
Soybean
55,799
Homo sapiens
Human
23,391
Methanocaldococcus jannaschii
M. jannaschii
1,773
Mus musculus
Mouse
21,615
Oryza sativa
Asian rice
43,649
Rattus norvegicus
Rat
21,272
Saccharomyces cerevisiae
Budding yeast
6,040
Schizosaccharomyces pombe
Fission yeast
5,128
Zea mays
Maize
39,299
Compressed prediction files for global health proteomes:
Species
Common Name
Reference Proteome
Predicted Structures
Download
Ajellomyces capsulatus
Ajellomyces capsulatus
9,199
Brugia malayi
Brugia malayi
8,743
Campylobacter jejuni
C. jejuni
1,620
Cladophialophora carrionii
Cladophialophora carrionii
11,170
Dracunculus medinensis
Dracunculus medinensis
10,834
Enterococcus faecium
Enterococcus faecium
2,823
Fonsecaea pedrosoi
Fonsecaea pedrosoi
12,509
Haemophilus influenzae
H. influenzae
1,662
Helicobacter pylori
H. pylori
1,538
Klebsiella pneumoniae
K. pneumoniae
5,727
Leishmania infantum
L. infantum
7,924
Madurella mycetomatis
Madurella mycetomatis
9,561
Mycobacterium leprae
Mycobacterium leprae
1,602
Mycobacterium tuberculosis
M. tuberculosis
3,988
Mycobacterium ulcerans
Mycobacterium ulcerans
9,033
Neisseria gonorrhoeae
N. gonorrhoeae
2,106
Nocardia brasiliensis
Nocardia brasiliensis
8,372
Onchocerca volvulus
Onchocerca volvulus
12,047
Paracoccidioides lutzii
Paracoccidioides lutzii
8,794
Plasmodium falciparum
P. falciparum
5,187
Pseudomonas aeruginosa
P. aeruginosa
5,556
Salmonella typhimurium
S. typhimurium
4,526
Schistosoma mansoni
Schistosoma mansoni
13,865
Shigella dysenteriae
S. dysenteriae
3,893
Sporothrix schenckii
Sporothrix schenckii
8,652
Staphylococcus aureus
S. aureus
2,888
Streptococcus pneumoniae
S. pneumoniae
2,030
Strongyloides stercoralis
Strongyloides stercoralis
12,613
Trichuris trichiura
Trichuris trichiura
9,564
Trypanosoma brucei
Trypanosoma brucei
8,491
Trypanosoma cruzi
T. cruzi
19,036
Wuchereria bancrofti
Wuchereria bancrofti
12,721
Compressed prediction files for Swiss-Prot:
File type
Predicted Structures
Download
Swiss-Prot (CIF files)
542,380
Swiss-Prot (PDB files)
542,380
License and attribution
Data is available for academic and commercial use, under a CC-BY-4.0 license.
EMBL-EBI expects attribution (e.g. in publications, services or products) for any of its online services, databases or software in accordance with good scientific practice.
If you make use of an AlphaFold prediction, please cite the following papers:
Jumper, J et al. Highly accurate protein structure prediction with AlphaFold. Nature (2021).Varadi, M et al. AlphaFold Protein Structure Database: massively expanding the structural coverage of protein-sequence space with high-accuracy models. Nucleic Acids Research (2021).
AlphaFold Data Copyright (2021) DeepMind Technologies Limited.
Feedback
If you want to share feedback on an AlphaFold structure prediction please contact alphafold@deepmind.com. If you have feedback on the website or experience any bugs please contact afdbhelp@ebi.ac.uk.
Disclaimer
The AlphaFold Data and other information provided on this site is for theoretical modelling only, caution should be exercised in its use. It is provided ‘as-is’ without any warranty of any kind, whether expressed or implied. For clarity, no warranty is given that use of the information shall not infringe the rights of any third party. The information is not intended to be a substitute for professional medical advice, diagnosis, or treatment, and does not constitute medical or other professional advice.
Use of the AlphaFold Protein Structure Database is subject to EMBL-EBI Terms of Use.
Protein Structure Prediction
- protein structure prediction - Bing
- Protein structure prediction
- Highly accurate protein structure prediction for the human proteome
- Submit a Prediction Job
- Advances in protein structure prediction and design | Nature Reviews Molecular Cell Biology
- A physical approach to protein structure prediction.
- Home - Prediction Center
- AlphaFold Protein Structure Database
- znfx1 - Zinc finger, NFX1-type-containing 1 - Danio rerio (Zebrafish) - znfx1 gene & protein
- AlphaFold Protein Structure Database