AlphaFold 2 by RPBS
data:image/s3,"s3://crabby-images/2f0bd/2f0bd492839d213a3f676c0a38c378d4a979b678" alt="RPBS"
data:image/s3,"s3://crabby-images/1cf77/1cf77d6c349499491eecaf24ec2e6ecd4392b588" alt="UPC"
Connection
To use AlphaFold 2 by RPBS connect on the Jupyterhub interface. Use your ipop-up account username and password.
You can register on ipop-up by mail at ipop-up-account-request@rpbs.univ-paris-diderot.fr or on RPBS discourse.
data:image/s3,"s3://crabby-images/2515c/2515cbede6dcb34919a084393f9d23cc10e753dd" alt="Jupyter Hub"
Server Options
data:image/s3,"s3://crabby-images/5bc48/5bc48c50041b7879f704034a4eb75ff498314894" alt="Server"
Here you can define the ressources needed by your job. You can choose the number of CPU, the amount of RAM and the GPU card.
-
To run AlphaFold 2 you need at least 1 GPU card A100, you can choose different slice of the A100 card 1, 2, 4 or 7 GPUs:
- A100 1G20B: ~700 residues
- A100 2G20B: ~150 residues
- A100 4G40B: ~3000 residues
- A100 7G80B: ~4000 residues
-
Concerning the RAM you should reserve at least 20 GB.
- One or two CPU are enough to run the job.
Create and running a Notebook
To create a new notebook click on the Notebook
Section and select Colabfold 1.5.5
Kernel.
- In the first cell type:
from colabfold_jupyter import interface
- Execute the cell by pressing by clicking the
►
button or by typingShift + Enter
.
data:image/s3,"s3://crabby-images/a8ec2/a8ec2226ad44b4f6b6d3390cee1caf977d9aab8f" alt="Run Cell"
- the interface should appear, if not type:
interface.show_widgets()
You can now admire the interface and run AlphaFold 2.
data:image/s3,"s3://crabby-images/76f96/76f96c549fb1886fe6dfea89285b186f2648eaa6" alt="Interface"
Running AlphaFold 2
- To run AlphaFold 2 you need to provide a protein sequence in the
Sequence
field. You can provide multiple sequences separated by a:
.
e.g. for a dimer of beta amyloid:
DAEFRHDSGYEVHHQKLVFFAEDVGSNKGAIIGLMVGGVVIA:DAEFRHDSGYEVHHQKLVFFAEDVGSNKGAIIGLMVGGVVIA
- You can also specify numerous options such as:
- the number of models to generate
- the number of recycles
- Different msa options
- ...
Once you have filled the form, click on the Launch Colabfold
button to start the job.
data:image/s3,"s3://crabby-images/d0860/d0860f72c94eca82ade9ed61f0767d83fdf5d2c2" alt="Start"
Downloading the results
Once the job is finished, you can download the results by clicking on the Download
button.
data:image/s3,"s3://crabby-images/3cdde/3cddec608db7c930450bf161d878cc38ec1be461" alt="Download"
The results are stored in a zip file that you can extract on your computer.
Results are also stored in your project directory on the server (/shared/project/XXX/
).
Analyzing the results
You can analyze the results by using the show_pdb_best()
function.
interface.show_pdb_best(interface.results)
data:image/s3,"s3://crabby-images/886f0/886f074bddd875998a70f8b55cdaf82d930e7980" alt="Show best"
Going further
You can find more information on the AlphaFold 2 website. If you have any questions or need help, you can ask on the RPBS discourse.
AF2 Analysis
We have also developed and integrated the af2_analysis
library to analyze the results of AlphaFold 2 results.
To use it you need to load the library and create a Data
object, using the output directory as input:
import af2_analysis
my_data = af2_analysis.Data(interface.results['dir'])
All computed data are extracted in the my_data.df
pandas DataFrame.
my_data.df
Some additional functions are available to complement the alphafold scores, like the pdockq and pdockq2.
my_data.compute_pdockq()
my_data.compute_pdockq2()
It is possible to plot the plddt scores:
my_data.plot_plddt(range(20))
data:image/s3,"s3://crabby-images/df678/df678d04142d7f897444f637cf9b440e98a45ea4" alt="Plddt"
or the PAE matrix:
my_data.plot_pae(my_data.df['ranking_confidence'].idxmax())
data:image/s3,"s3://crabby-images/49eb9/49eb921d1fa2323122d9a469879056eea653946c" alt="PAE"
Acknowledgements
- Julien Rey and Samuel Murail for the deployment of the AlphaFold 2 server at RPBS.
- Nicolas Chevrollier, Gabriel Tourillon, Gautier Moroy and Pierre Tuffery
- The RPBS platform for computational resources.
- IdEx Université Paris Cité n°ANR-18-IDEX-0001 projet GPU-APBS 2023.
Reference:
-
Mirdita M, Schütze K, Moriwaki Y, Heo L, Ovchinnikov S and Steinegger M. ColabFold: making protein folding accessible to all.
Nature Methods (2022) doi: 10.1038/s41592-022-01488-1 -
Jumper et al. "Highly accurate protein structure prediction with AlphaFold."
Nature (2021) doi: 10.1038/s41586-021-03819-2 -
Evans et al. "Protein complex prediction with AlphaFold-Multimer."
biorxiv (2021) doi: 10.1101/2021.10.04.463034v1