AI-based Protein 3D Prediction using AlphaFold

Main Article Content

Alejandro Hernández-Soto
Alexander Barquero

Abstract

Three-dimensional protein prediction with a high approximation to the actual conformation is conceptually possible using mathematical models according to the Anfinsen dogma. Still, it is impractical due to the multiple conformations that add complexity due to the Levinthal paradox. One way to solve this puzzle is by using machine learning models based on structures that have already been elucidated using artificial intelligence. The AlphaFold program allows de novo predictions by using machine learning algorithms. This paper explains the tool’s components and the metrics used to interpret the results. It provides optional ways to access the program, along with a practical example to learn how to execute a prediction. It concludes with the principles of use and ethics of the tool.

Article Details

How to Cite
Hernández-Soto , A., & Barquero , A. (2026). AI-based Protein 3D Prediction using AlphaFold. Tecnología En Marcha Journal, 39(5), Pág. 203–221. https://doi.org/10.18845/tm.v39i5.8492
Section
Aplicaciones científicas y ambientales de la IA

References

[1] C. B. Anfinsen, “Principles that govern the folding of protein chains,” Science, vol. 181, no. 4096, pp. 223–230, 1973, doi: 10.1126/science.181.4096.223.

[2] R. Zwanzig, A. Szabo, and B. Bagchi, “Levinthal’s paradox,” Proc. Natl. Acad. Sci. USA, vol. 89, no. 1, pp. 20–22, 1992, doi: 10.1073/pnas.89.1.20.

[3] J. Jumper et al., “Highly accurate protein structure prediction with AlphaFold,” Nature, vol. 596, no. 7873, pp. 583-589, 2021, doi: 10.1038/s41586-021-03819-2.

[4] J. Abramson et al., “Accurate structure prediction of biomolecular interactions with AlphaFold 3,” Nature, vol. 630, no. 8016, pp. 493–500, 2024, doi: 10.1038/s41586-024-07487-w.

[5] U. Göbel, C. Sander, R. Schneider, and A. Valencia, “Correlated mutations and residue contacts in proteins,” Proteins, vol. 18, no. 4, pp. 309–317, 1994, doi: 10.1002/prot.340180402.

[6] S. A. Benner and D. Gerloff, “Patterns of divergence in homologous proteins as indicators of secondary and tertiary structure,” Adv. Enzyme Regul., vol. 31, pp. 121–181, 1991, doi: 10.1016/0065-2571(91)90012-B.

[7] W. R. Taylor and K. Hatrick, “Compensating changes in protein multiple sequence alignments,” Protein Eng. Des. Sel., vol. 7, no. 3, pp. 341–348, 1994, doi: 10.1093/protein/7.3.341.

[8] V. Mariani, M. Biasini, A. Barbato, and T. Schwede, “lDDT: A local superposition-free score for comparing protein structures and models,” Bioinformatics, vol. 29, no. 21, pp. 2722–2728, 2013, doi: 10.1093/bioinformatics/btt473.

[9] H.-B. Guo et al., “AlphaFold2 models indicate that protein sequence determines both structure and dynamics,” Sci. Rep., vol. 12, p. 10696, 2022, doi: 10.1038/s41598-022-14382-9.

[10] G. Kim et al., “Easy and accurate protein structure prediction using ColabFold,” 2023. [Online]. Available: https://colabfold.mmseqs.com

[11] R. Krishna et al., “Generalized biomolecular modeling and design with RoseTTAFold All-Atom,” Science, vol. 384, no. 6693, p. eadl2528, 2024, doi: 10.1126/science.adl2528.

[12] M. van Kempen et al., “Fast and accurate protein structure search with Foldseek,” Nat. Biotechnol., vol. 42, no. 2, pp. 243–246, 2024, doi: 10.1038/s41587-023-01773-0.

[13] M. Varadi et al., “AlphaFold Protein Structure Database in 2024: providing structure coverage for over 214 million protein sequences,” Nucleic Acids Res., vol. 52, no. D1, pp. D368–D375, 2024, doi: 10.1093/nar/gkad1011.

[14] B. T. Bajar et al., “Improving brightness and photostability of green and red fluorescent proteins for live cell imaging,” Sci. Rep., vol. 6, p. 20889, 2016, doi: 10.1038/srep20889.

[15] S. Papadaki et al., “Dual-expression system for blue fluorescent protein optimization,” Sci. Rep., vol. 12, p. 13214, 2022, doi: 10.1038/s41598-022-13214-0.

[16] T. M. Wannier et al., “Monomerization of far-red fluorescent proteins,” Proc. Natl. Acad. Sci. USA, vol. 115, no. 48, pp. E11294–E11301, 2018, doi: 10.1073/pnas.1807449115.

[17] J. Lee et al., “Versatile phenotype-activated cell sorting,” Sci. Adv., vol. 6, no. 43, p. eabb7438, 2020, doi: 10.1126/sciadv.abb7438.

[18] J. L. Watson et al., “De novo design of protein structure and function with RFdiffusion,” Nature, vol. 620, no. 7976, pp. 1089–1100, 2023, doi: 10.1038/s41586-023-06415-8.

[19] S. Vázquez Torres et al., “De novo designed proteins neutralize lethal snake venom toxins,” Nature, vol. 639, no. 8053, pp. 225–231, 2025, doi: 10.1038/s41586-024-08393-x.

[20] S. Tang et al., “De novo gene synthesis by an antiviral reverse transcriptase,” Science, vol. 386, no. 6717, pp. 1–41, 2024, doi: 10.1126/science.adq0876.