Multiple Sequence Alignment for Predicting Antigen-Antibody Interactions

Published on ● Video Link: https://www.youtube.com/watch?v=6n6JSRPporI



Duration: 32:36
116 views
3


Kazutaka Katoh (Osaka University)
https://simons.berkeley.edu/talks/multiple-sequence-alignment-predicting-antigen-antibody-interactions
Computational Challenges in Very Large-Scale 'Omics'

A possible direction of development and application of the MAFFT multiple sequence alignment (MSA) program will be discussed. An application we are considering is the prediction of protein-protein interactions. This area has been studied well and recently AlphaFold and AlphaFold-Multimer achieved highly accurate predictions using MSAs as features. However, their well-known weak point is in prediction of antigen-antibody interactions. Our hypothesis is that one of the reasons for the difficulty is that the sequences in an MSA of antibodies have evolutionary relationships greatly different from those of other proteins. Antibodies share a common ancestry, and typically a common framework region. However, they exhibit high diversity in the complementarity-determining regions (CDRs) through recombination and hypermutations in individual cells in an individual organism. Thus, if the sequences are collected by a standard database search, an MSA of antibodies is a mixture of a variety of homologous sequences that bind different binding sites (epitopes) of various antigens. Such an MSA can be noisy for predicting the interaction between an antibody and an antigen. A solution can be to use the InterClone database, being developed in our lab. InterClone clusters antibodies by their CDR sequences and can be used to prepare a set of antibody sequences that are more likely to share an antigen and epitope than sequences gathered by a standard sequence similarity search. We are also planning to experimentally determine the positions of epitopes in several protein antigens to obtain an informative set of antibody-antigen pairs and to expand the training data for AlphaFold or other structure prediction methods.




Other Videos By Simons Institute for the Theory of Computing


2022-07-22New Approaches for Phylogenetic Species Tree Estimation
2022-07-22Single Cell Brain Isoforms in Space and Time
2022-07-22Benchmarking, Inference, and in Silico Controls in Single-Cell and Spatial Omics Data Science
2022-07-22Learning Gene Association Networks Using Single-Cell RNA-Seq Data: A Graphical Model Approach
2022-07-22Mapping Gene Regulatory Dependencies with Single-Cell Resolution
2022-07-22Harnessing Multimodal Single-Cell Sequencing Data for Integrative Analysis
2022-07-22Learning From Large-Scale (Single-Cell) ‘Omics’
2022-07-22Panel Discussion
2022-07-22Exploratory and Model-Based Analysis of ScHi-C Data
2022-07-22The Earth Biogenome Project: Progress and the Challenges Ahead
2022-07-22Multiple Sequence Alignment for Predicting Antigen-Antibody Interactions
2022-07-22Evolution of Germline Mutation Spectrum in Humans
2022-07-22Sequence Bioinformatics at Large Scale: Petabase-Scale Sequence Alignment Catalyses Viral Discovery
2022-07-22Long-Read Transcriptome Complexity and Cell-Type Regulatory Signatures in ENCODE4
2022-07-22Leveraging Long Reads Sequencing for Developing a Functional Iso-Transcriptomics Analysis Framework
2022-07-22Multi-Omic Integration for Understanding Disease
2022-07-22The Epigenetic Logic of Gene Activation
2022-07-22Profiling of Antibody Repertoires and Immunoglobulin Loci Enables Large-Scale Analysis of...
2022-07-22Leveraging Molecular Data for Drug Discovery
2022-07-22The Rewards and Challenges of Constructing Patient Registries in Mexico
2022-07-22Whole Genome Methylation Patterns as a Biomarkers for EHR Imputation



Tags:
Simons Institute
theoretical computer science
UC Berkeley
Computer Science
Theory of Computation
Theory of Computing
Computational Challenges in Very Large-Scale 'Omics'
Kazutaka Katoh