By analyzing a massive and diverse microbiome dataset, they developed a genomic language model (gLM) that learns to understand the functional “semantics” and regulatory “syntax” of each gene. The gLM model, like large language models, learns through introspection; This means it learns meaningful gene representations from data without the need for human labels.
The researchers found that the gLM model can recognize enzyme functions, co-regulated gene modules, and provide genomic context that can predict gene functions. This allows us to more accurately understand the functions of genes and the relationships between them.
The gLM model represents a significant advance in bioinformatics and machine learning that can accelerate the discovery of new biological mechanisms and improve the understanding of genomic information.
Source: Ferra

I am a professional journalist and content creator with extensive experience writing for news websites. I currently work as an author at Gadget Onus, where I specialize in covering hot news topics. My written pieces have been published on some of the biggest media outlets around the world, including The Guardian and BBC News.