clean-IT: Towards Sustainable Digital Technologiesclean-IT Initiative

Dieses Video gehört zum openHPI-Kurs clean-IT: Towards Sustainable Digital Technologies. Möchten Sie mehr sehen?

Vitor Piro (HPI) - Energy-efficient DNA Sequencing

Zeitaufwand: etwa 9 Minuten

Beim Laden des Videoplayers ist ein Fehler aufgetreten, oder es dauert lange, bis er initialisiert wird. Sie können versuchen, Ihren Browser-Cache zu leeren. Bitte versuchen Sie es später noch einmal und wenden Sie sich an den Helpdesk, wenn das Problem weiterhin besteht.

Über dieses Video


Cheaper, faster and scalable DNA sequencing is causing an exponential growth of genomic data in the tree of life. Large and diverse datasets increase the possibilities of biological research but are difficult to be fully used due to their size and complexity. Ganon is a software to perform approximate DNA-to-DNA matching, that is ganon enables searching very small ‘DNA-needles’ in very large ‘DNA-haystacks’. This is computationally challenging, but becomes feasible applying a specialized probabilistic data structure to process large amounts of DNA sequences, enabling efficient matching. Additionally, ganon can update already indexed data incrementally, meaning that new genome sequences can be incorporated into existing indices in a fraction of time needed to re-index them, drastically reducing its computational cost. Compared to similar approaches, ganon indexes up to 50 times faster and is the only software able to update indices, reducing redundant, energy consuming computations from hours to minutes. More information...

Vitor C. Piro is a postdoctoral researcher at the Hasso-Plattner-Institut in Potsdam, Germany with a PhD in Bioinformatics from the Freie Universität Berlin. He has experience in biological data analysis and scientific software development for bioinformatics, with focus on microbial communities analysis, environmental data, taxonomic classification and metagenomics. Piro is interested in all areas and steps related to microbiome and DNA analysis, but especially in the development of efficient algorithms to bride the fields of computer science, bioinformatics application and data analysis. A summary of his work and publications can be found at Github.