eScience Workshop 2005 - A Platform for Computational Comparative Genomics on the Web

Subscribers:
349,000
Published on ● Video Link: https://www.youtube.com/watch?v=rd4h6k7IfiQ



Duration: 23:01
278 views
2


We have been developing a Web-based system for comparing multiple genomes, PLATCOM, where users can choose genomes and perform analysis of the selected genomes with a suite of computational tools. PLATCOM is built on internal databases such as GenBank, COG, KEGG, and Pairwise Comparison Database (PCDB) that contains all pairwise comparisons (97,034 entries) of protein sequence files (.faa) and whole genome sequence files (.fna) of 312 replicons. The pre-computed PCDB makes it possible to complete genome analysis very fast even on the web, so that users can choose any combination of genomes and analyze them with data mining tools. Genome comparison requires combining many sequence analysis tools. However, combining multiple tools for sequence analysis requires a significant amount of programming work and knowledge on each tool, thus it is very challenging to provide a service for comparing genomes on the web by using standard sequence analysis tools. Thus, to make genome comparison be done on the web, well-defined data mining concept and tools are very important since they can make genome comparison much easier. It is also important that the data mining tools for genome comparison should be scalable. We have been developing such scalable tools: a sequence clustering algorithm BAG, a metabolic pathway analysis tool MetaPath, a gene fusion event detection tool FuzFinder, a gene neighborhood navigation tool OperonViz, an algorithm for mining correlated gene sets MCGS, a genome sequence alignment tool GAME, a multiple genome sequence alignment algorithm by clustering local matches mgAlign, and a pairwise genome visulization tool COMPAM. The analysis results are summarized with visualization tools. We are currently working on integrating the data mining modules such that users can combine these in a very flexible way. In addition to sequence data, PLATCOM will include more data types such as gene expression data.




Other Videos By Microsoft Research


2016-09-07SCS '06 - Lightning Round 2: Mobile/Pervasive Social Computing - Talk 3
2016-09-07SCS '06 - Lightning Round 1: Learning in and about Virtual Worlds - Talk 6
2016-09-07SCS '06 - Lightning Round 2: Mobile/Pervasive Social Computing - Talk 1
2016-09-07eScience Workshop 2005 - Environmental Science from Satellites
2016-09-07SANGAM: A System for Integrating Web Services to Investigate Stress-Circuitry-Gene Coupling
2016-09-07SCS '06 - Lightning Round 1: Learning in and about Virtual Worlds - Talk 4
2016-09-07eScience Workshop 2005 - Creating the Personal Supercomputer
2016-09-07eScience Workshop 2005 - WorldWind
2016-09-07WACE 2005 - Keynote
2016-09-07Computationally-intensive biomedical research projects supported by National Institutes of Health
2016-09-07eScience Workshop 2005 - A Platform for Computational Comparative Genomics on the Web
2016-09-07Using .NET and Web Services to build an e-Science Application: Looking for White Dwarfs
2016-09-07eScience Workshop 2005 - Integration and Visualization in Bioinformatics
2016-09-07Streamlining Scientific Research via Electronic Laboratory Notebooks and Wireless Sensors
2016-09-07eScience Workshop 2005 - Computational Data Grid for Scientific and Biomedical Applications
2016-09-07eScience Workshop 2005 - Making NEXRAD Precipitation Data Available to the Hydrology Community
2016-09-07eScience Workshop 2005 - The WiFi eTransit Village
2016-09-07Microsoft Research Faculty Summit 2005 — Future of Scientific Computing Panel
2016-09-07WACE 2005 - SID Grid: Collaborative Experimentation in a Sensor-Rich Laboratory
2016-09-07eScience Workshop 2005 - Web Services for HPC ΓÇö Making Seamless Computing a Reality
2016-09-07WACE 2005 - Collaboration in Directly Mediated Interaction Environments



Tags:
microsoft research