Leveraging Long Reads Sequencing for Developing a Functional Iso-Transcriptomics Analysis Framework

Channel:

Simons Institute for the Theory of Computing

Subscribers:

68,700

Published on July 22, 2022 6:03:24 PM ● Video Link: https://www.youtube.com/watch?v=BzmPIfdmuL0

Duration: 35:55

36 views

Ana Conesa (Spanish National Research Council)
https://simons.berkeley.edu/talks/leveraging-long-reads-sequencing-develop-functional-iso-transcriptomics-analysis-framework
Computational Challenges in Very Large-Scale 'Omics'

Post-transcriptional mechanisms such as Alternative Splicing (AS) and Alternative PolyAdenylation (APA) regulate the maturation of pre-mRNAs and may result in different transcripts arising from the same gene, increasing the diversity and regulation capacity of transcriptomes and proteomes. AS and APA has been extensively characterized at the mechanistic levels but to a lesser extent in terms of functional impact. While functional profiling is widely used to characterize the functional relevance of gene expression at the genome-wide level, similar tools at isoform resolution are missing. In contrast to short reads, single molecular sequencing technologies allow for direct sequencing of full-length transcripts, and novel tools are needed to leverage the information potential of these platforms to study the functional consequences of alternative transcript processing. Particularly, RNA sequencing using long reads technologies results in a vast number of novel transcripts that are a mixture of representations of true molecules and technology artifacts. Additionally, functional annotation at isoform resolution has not been developed yet. Here we present a novel computational framework for Functional Iso-Transcriptomics analysis (FIT), specially designed to study isoform (differential) expression from a functional perspective. This framework consists of three bioinformatics developments. SQANTI is used to define and curate expressed transcriptomes obtained with long-read technologies. SQANTI categorizes full-length reads, evaluates their potential biases, and removes low-quality instances. The IsoAnnot pipeline combines multiple databases and function prediction algorithms to return a rich isoform-level annotation file of functional domains, motifs, and sites, both coding and non-coding. Finally, the tappAS software introduces novel analysis methods to interrogate the functional relevance of isoform complexity. I will show the application of the FIT framework to the analysis of differentiating mouse neural cells.

Other Videos By Simons Institute for the Theory of Computing

2022-07-22	Mapping Gene Regulatory Dependencies with Single-Cell Resolution
2022-07-22	Harnessing Multimodal Single-Cell Sequencing Data for Integrative Analysis
2022-07-22	Learning From Large-Scale (Single-Cell) ‘Omics’
2022-07-22	Panel Discussion
2022-07-22	Exploratory and Model-Based Analysis of ScHi-C Data
2022-07-22	The Earth Biogenome Project: Progress and the Challenges Ahead
2022-07-22	Multiple Sequence Alignment for Predicting Antigen-Antibody Interactions
2022-07-22	Evolution of Germline Mutation Spectrum in Humans
2022-07-22	Sequence Bioinformatics at Large Scale: Petabase-Scale Sequence Alignment Catalyses Viral Discovery
2022-07-22	Long-Read Transcriptome Complexity and Cell-Type Regulatory Signatures in ENCODE4
2022-07-22	Leveraging Long Reads Sequencing for Developing a Functional Iso-Transcriptomics Analysis Framework
2022-07-22	Multi-Omic Integration for Understanding Disease
2022-07-22	The Epigenetic Logic of Gene Activation
2022-07-22	Profiling of Antibody Repertoires and Immunoglobulin Loci Enables Large-Scale Analysis of...
2022-07-22	Leveraging Molecular Data for Drug Discovery
2022-07-22	The Rewards and Challenges of Constructing Patient Registries in Mexico
2022-07-22	Whole Genome Methylation Patterns as a Biomarkers for EHR Imputation
2022-07-22	Biological Discovery and Consumer Genomics Databases Activate Latent Privacy Risk in...
2022-07-22	How Do We Deliver Precision Health at Scale for All?
2022-07-22	Longitudinal Phenotypes and Disease Trajectories at Population Scale
2022-07-22	Towards Making Identification of Noncoding Causes of Human Disease Routine

Tags:

Simons Institute

theoretical computer science

UC Berkeley

Computer Science

Theory of Computation

Theory of Computing

Computational Challenges in Very Large-Scale 'Omics'

Ana Conesa

Channel	Latest
S-Tavo Plays	6 hours ago
Winkazi	6 hours ago
smskcntr	6 hours ago
AhtmosTV	6 hours ago
ScarletMarisa375	6 hours ago
OtakuPT	6 hours ago
Koragg Wolzard WolfThunderRangerKilleranger34*	6 hours ago
Insert Coin	7 hours ago
Crainer	7 hours ago
Overdrive	7 hours ago
Game Guides Channel	7 hours ago
Sveneta	7 hours ago
ImpulseDm	7 hours ago
GrizzoUK	7 hours ago
Outstanding Gameplays	7 hours ago
DiDi	7 hours ago
Twin Style	7 hours ago
Zeol VTuber	7 hours ago
AH Brandon Reviews	7 hours ago
Hiria Games	7 hours ago
TimmyTurnersGrandDad	7 hours ago
SlyfoXlive	7 hours ago
Trebor	8 hours ago
Carter300	8 hours ago
The Head Buddies	8 hours ago