Leveraging Long Reads Sequencing for Developing a Functional Iso-Transcriptomics Analysis Framework

Published on ● Video Link: https://www.youtube.com/watch?v=BzmPIfdmuL0



Duration: 35:55
36 views
3


Ana Conesa (Spanish National Research Council)
https://simons.berkeley.edu/talks/leveraging-long-reads-sequencing-develop-functional-iso-transcriptomics-analysis-framework
Computational Challenges in Very Large-Scale 'Omics'

Post-transcriptional mechanisms such as Alternative Splicing (AS) and Alternative PolyAdenylation (APA) regulate the maturation of pre-mRNAs and may result in different transcripts arising from the same gene, increasing the diversity and regulation capacity of transcriptomes and proteomes. AS and APA has been extensively characterized at the mechanistic levels but to a lesser extent in terms of functional impact. While functional profiling is widely used to characterize the functional relevance of gene expression at the genome-wide level, similar tools at isoform resolution are missing. In contrast to short reads, single molecular sequencing technologies allow for direct sequencing of full-length transcripts, and novel tools are needed to leverage the information potential of these platforms to study the functional consequences of alternative transcript processing. Particularly, RNA sequencing using long reads technologies results in a vast number of novel transcripts that are a mixture of representations of true molecules and technology artifacts. Additionally, functional annotation at isoform resolution has not been developed yet. Here we present a novel computational framework for Functional Iso-Transcriptomics analysis (FIT), specially designed to study isoform (differential) expression from a functional perspective. This framework consists of three bioinformatics developments. SQANTI is used to define and curate expressed transcriptomes obtained with long-read technologies. SQANTI categorizes full-length reads, evaluates their potential biases, and removes low-quality instances. The IsoAnnot pipeline combines multiple databases and function prediction algorithms to return a rich isoform-level annotation file of functional domains, motifs, and sites, both coding and non-coding. Finally, the tappAS software introduces novel analysis methods to interrogate the functional relevance of isoform complexity. I will show the application of the FIT framework to the analysis of differentiating mouse neural cells.




Other Videos By Simons Institute for the Theory of Computing


2022-07-22Mapping Gene Regulatory Dependencies with Single-Cell Resolution
2022-07-22Harnessing Multimodal Single-Cell Sequencing Data for Integrative Analysis
2022-07-22Learning From Large-Scale (Single-Cell) ‘Omics’
2022-07-22Panel Discussion
2022-07-22Exploratory and Model-Based Analysis of ScHi-C Data
2022-07-22The Earth Biogenome Project: Progress and the Challenges Ahead
2022-07-22Multiple Sequence Alignment for Predicting Antigen-Antibody Interactions
2022-07-22Evolution of Germline Mutation Spectrum in Humans
2022-07-22Sequence Bioinformatics at Large Scale: Petabase-Scale Sequence Alignment Catalyses Viral Discovery
2022-07-22Long-Read Transcriptome Complexity and Cell-Type Regulatory Signatures in ENCODE4
2022-07-22Leveraging Long Reads Sequencing for Developing a Functional Iso-Transcriptomics Analysis Framework
2022-07-22Multi-Omic Integration for Understanding Disease
2022-07-22The Epigenetic Logic of Gene Activation
2022-07-22Profiling of Antibody Repertoires and Immunoglobulin Loci Enables Large-Scale Analysis of...
2022-07-22Leveraging Molecular Data for Drug Discovery
2022-07-22The Rewards and Challenges of Constructing Patient Registries in Mexico
2022-07-22Whole Genome Methylation Patterns as a Biomarkers for EHR Imputation
2022-07-22Biological Discovery and Consumer Genomics Databases Activate Latent Privacy Risk in...
2022-07-22How Do We Deliver Precision Health at Scale for All?
2022-07-22Longitudinal Phenotypes and Disease Trajectories at Population Scale
2022-07-22Towards Making Identification of Noncoding Causes of Human Disease Routine



Tags:
Simons Institute
theoretical computer science
UC Berkeley
Computer Science
Theory of Computation
Theory of Computing
Computational Challenges in Very Large-Scale 'Omics'
Ana Conesa