May 20 2024Insilico Medicine In a new paper, researchers from clinical stage artificial intelligence -driven drug discovery company Insilico Medicine , in collaboration with NVIDIA, present a new large language model transformer for solving biological and chemical tasks called nach0.
Nach0 seeks to bridge this gap for the first time. It draws from a dataset that includes abstract texts extracted from PubMed and patent descriptions derived from the U.S. Patent and Trademark Office related to the chemistry domain – 100 million documents that became 355 million tokens worth of abstracts and 2.9 billion patents, as well as molecular structures using simplified molecular-input line-entry system .
Nach0 represents a step forward in automating drug discovery through natural language prompts. In the future, we foresee the potential inclusion of protein sequences with their own special tokens as well as fine-tuning the model in order to accommodate new modalities and exploring the fusion of information from text and knowledge graphs." Nach0 is built on the NVIDIA BioNeMo generative AI platform, enabling training and scaling of drug discovery applications.
Related StoriesMeasured against other LLMs used for biomedical understanding, such as FLAN, SciFive, and MolT5, nach0 was found to have distinct advantages when performing molecular tasks using molecular data, and it significantly outperformed ChatGPT.
Technology Technology Latest News, Technology Technology Headlines
Similar News:You can also read news stories similar to this one that we have collected from other news sources.
Source: NewsMedical - 🏆 19. / 71 Read more »
Source: pcgamer - 🏆 38. / 67 Read more »
Source: CreativeBloq - 🏆 40. / 65 Read more »