Simulating 500 million years of evolution with a language model. (Record no. 61986)
[ view plain ]
| 000 -LEADER | |
|---|---|
| fixed length control field | 01736nam a2200313Ia 4500 |
| 003 - CONTROL NUMBER IDENTIFIER | |
| control field | MX-MdCICY |
| 005 - DATE AND TIME OF LATEST TRANSACTION | |
| control field | 20251009160708.0 |
| 040 ## - CATALOGING SOURCE | |
| Transcribing agency | CICY |
| 090 ## - LOCALLY ASSIGNED LC-TYPE CALL NUMBER (OCLC); LOCAL CALL NUMBER (RLIN) | |
| Classification number (OCLC) (R) ; Classification number, CALL (RLIN) (NR) | B-21897 |
| 008 - FIXED-LENGTH DATA ELEMENTS--GENERAL INFORMATION | |
| fixed length control field | 251009s9999 xx 000 0 und d |
| 245 10 - TITLE STATEMENT | |
| Title | Simulating 500 million years of evolution with a language model. |
| 490 0# - SERIES STATEMENT | |
| Series statement | Science, 387(6736), 850-858, 2025. |
| 500 ## - GENERAL NOTE | |
| General note | Artículo |
| 520 3# - SUMMARY, ETC. | |
| Summary, etc. | More than 3 billion years of evolution have produced an image of biology encoded into the space of natural proteins. Here, we show that language models trained at scale on evolutionary data can generate functional proteins that are far away from known proteins. We present ESM3, a frontier multimodal generative language model that reasons over the sequence, structure, and function of proteins. ESM3 can follow complex prompts combining its modalities and is highly responsive to alignment to improve its fidelity. We have prompted ESM3 to generate fluorescent proteins. Among the generations that we synthesized, we found a bright fluorescent protein at a far distance (58% sequence identity) from known fluorescent proteins, which we estimate is equivalent to simulating 500 million years of evolution. |
| 650 14 - SUBJECT ADDED ENTRY--TOPICAL TERM | |
| Topical term or geographic name entry element | COMPUTER SIMULATION |
| 650 14 - SUBJECT ADDED ENTRY--TOPICAL TERM | |
| Topical term or geographic name entry element | EVOLUTION, MOLECULAR |
| 650 14 - SUBJECT ADDED ENTRY--TOPICAL TERM | |
| Topical term or geographic name entry element | LANGUAGE |
| 650 14 - SUBJECT ADDED ENTRY--TOPICAL TERM | |
| Topical term or geographic name entry element | LUMINESCENT PROTEINS |
| 650 14 - SUBJECT ADDED ENTRY--TOPICAL TERM | |
| Topical term or geographic name entry element | SEQUENCE ALIGNMENT |
| 700 12 - ADDED ENTRY--PERSONAL NAME | |
| Personal name | Hayes, T. |
| 700 12 - ADDED ENTRY--PERSONAL NAME | |
| Personal name | Rao, R. |
| 700 12 - ADDED ENTRY--PERSONAL NAME | |
| Personal name | Akin, H. |
| 700 12 - ADDED ENTRY--PERSONAL NAME | |
| Personal name | Sofroniew, N. J. |
| 700 12 - ADDED ENTRY--PERSONAL NAME | |
| Personal name | Oktay, D. |
| 700 12 - ADDED ENTRY--PERSONAL NAME | |
| Personal name | Lin, Z. |
| 700 12 - ADDED ENTRY--PERSONAL NAME | |
| Personal name | Rives, A. |
| 856 40 - ELECTRONIC LOCATION AND ACCESS | |
| Uniform Resource Identifier | <a href="https://drive.google.com/file/d/17VFIWMKQlXGL5mCBizj9KcLEmzSRA2uX/view?usp=drive_link">https://drive.google.com/file/d/17VFIWMKQlXGL5mCBizj9KcLEmzSRA2uX/view?usp=drive_link</a> |
| Public note | Para ver el documento ingresa a Google con tu cuenta: @cicy.edu.mx |
| 942 ## - ADDED ENTRY ELEMENTS (KOHA) | |
| Source of classification or shelving scheme | Clasificación local |
| Koha item type | Documentos solicitados |
| Lost status | Source of classification or shelving scheme | Damaged status | Not for loan | Collection | Home library | Current library | Shelving location | Date acquired | Total checkouts | Full call number | Date last seen | Price effective from | Koha item type |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Clasificación local | Ref1 | CICY | CICY | Documento préstamo interbibliotecario | 09.10.2025 | B-21897 | 09.10.2025 | 09.10.2025 | Documentos solicitados |
