https://www.biorxiv.org/content/10.1101/2023.09.24.559168v1
GET: a foundation model of transcription across human cell types
Transcriptional regulation, involving the complex interplay between regulatory sequences and proteins, directs all biological processes. Computational models of transcriptions lack generalizability to accurately extrapolate in unseen cell types and conditi
www.biorxiv.org
Summary
The document titled "GET: a foundation model of transcription across human cell types" introduces the General Expression Transformer (GET), an innovative computational model designed to understand transcriptional regulation across a broad spectrum of human fetal and adult cell types. Here are the key points summarized from the document:
Key Findings
- Model Overview: GET is a foundation model that leverages chromatin accessibility data and sequence information to learn transcriptional regulatory syntax, allowing it to predict gene expression across both seen and unseen cell types with high accuracy.
- Comprehensive Data Utilization: The model uses data from about 213 human cell types, combining single-cell ATAC-seq (for chromatin accessibility) and RNA-seq data to train and fine-tune its predictions.
- Advanced Predictive Abilities: GET showcases exceptional adaptability and accuracy in gene expression prediction across new cell types and conditions. It outperforms existing models in identifying cis-regulatory elements and upstream regulators, as well as in the prediction of lentivirus-based massive parallel reporter assay readouts.
- Interpretable Insights: By interpreting GET, researchers can uncover detailed regulatory insights and transcription factor interactions, providing valuable information for nearly every gene in the studied cell types.
- Practical Applications: The model's capabilities extend to identifying distal regulatory regions missed by other models, understanding specific transcription factor interactions related to diseases like lymphoma, and designing synthetic biology applications.
- Open Access and Integration: The GET model, along with its training and application data, is made available to the scientific community, ensuring that it can be used as a powerful tool in further research.
Overall, the GET model represents a significant advance in computational biology, offering a robust and versatile tool for exploring the complexities of gene regulation across a wide array of human cell types.

- Transcriptional regulation
- Gene expression
- Computational model
- Foundation model
- Human cell types
- Chromatin accessibility
- RNA sequencing (RNA-seq)
- Single-cell ATAC-seq
- Gene regulation
- Transcription factors
- Machine learning
- Predictive modeling
- Regulatory elements
'Journal Club' 카테고리의 다른 글
| [D+1] [240429] "Bento: a toolkit for subcellular analysis of spatial transcriptomics data" (0) | 2024.05.01 |
|---|