boston 1

Keynotes

Shawn Murphy, Partners Healthcare

Title: Instrumenting the Health Care Enterprise for Discovery in the Course of Clinical Care
Abstract: Although patients may have a wealth of imaging, genomic, monitoring, and personal device data, it has yet to be fully integrated into clinical care. We identify three reasons for the lack of integration. The first is that “Big Data” is poorly managed by most Electronic Medical Record Systems (EMRS). The data is mostly available on “cloud-native” platforms that are outside the scope of most EMRS, and even checking if such data is available on a patient often must be done outside the EMRS. The second reason is that extracting features from the Big Data that are relevant to healthcare often requires complex machine learning algorithms, such as determining if a genomic variant is protein-altering. The third reason is that applications that present the big data need to be modified constantly to reflect the current state of knowledge, such as instructing when to order a new set of genomic tests. In some cases, the applications need to be updated nightly. A new architecture for the EMRS is evolving which could unite Big Data, machine learning, and clinical care through a microservice-based architecture which can host applications focused on quite specific aspects of clinical care, such as managing cancer immunotherapy. Informatics innovation, medical research, and clinical care go hand in hand as we look to infuse science-based practice into healthcare. Innovative methods will lead to in a new ecosystem of Apps interacting with healthcare providers to fulfill a promise that is still to be determined.

Biography: Dr. Murphy is the Corporate Director of Research Informatics and Computing at Partners HealthCare, is an Associate Professor of Neurology and Biomedical Informatics at Harvard Medical School, and serves as Associate Director for the Laboratory of Computer Science at the Massachusetts General Hospital. He received his BS in Chemistry from the University of Notre Dame, and his Ph.D. in Pharmacology and Physiology and MD from the University of Chicago. Dr. Murphy’s research interests include the creation of query methods for healthcare data that enable them to be directly used by scientists even when the data is extremely large. Dr. Murphy has created high impact and widely accepted open source projects that focus on the integration of highly diverse phenotypic, imaging, and genomic data such that new discoveries can be visualized and produced from secondary use of routinely collected healthcare data and be applied to new methods of providing clinical decision support in a learning healthcare system.

Dagmar Ringe, Brandeis University

Title: Challenges to Drug Design
Abstract: There are a number of problems to which these methods are being applied, and they include: the structural basis for efficient enzyme catalysis of proton and hydride transfer; the role of the metal ions in bridged bimetallo-enzyme active sites; direct visualization of proteins in action by time-resolved protein crystallography; the structural basis for reaction selectivity in enzymes; the evolution of new enzyme activities from old ones and the development of new pathways. The most important of these problems is the study of neurodegenerative diseases, especially Parkinson’s, Alzheimer’s and Lou Gehrig’s diseases, with a view toward determining the functions of gene products associated with such diseases, finding common pathways that underlie them, and the search for and design of specific inhibitors or activators as potential drugs to treat these diseases. A number of methods are being used, from development of genetic and in vitro screens to structure-based drug design methods.

Biography: Dr. Ringe received her B.A. degree in Chemistry from Barnard College in 1963 and Ph.D. degree in Organic Chemistry from Boston University in 1968. She is currently the Harold and Bernice Davis Professor of Aging and Neurodegenerative Diseases at Brandeis University, and Adjunct Professor Neurology at Harvard Medical School. Her research is aimed at the study of the molecular mechanisms of enzymes and the functions of proteins, with a focus on the determination of function for proteins of unknown function and the development of methods for intervention on that function. The tools being used include X-ray crystallography, molecular biology, kinetics, organic synthesis, genetics, and computational approaches.

There are a number of problems to which these methods are being applied, and they include: the structural basis for efficient enzyme catalysis of proton and hydride transfer; the role of the metal ions in bridged bimetallo-enzyme active sites; direct visualization of proteins in action by time-resolved protein crystallography; the structural basis for reaction selectivity in enzymes; the evolution of new enzyme activities from old ones and the development of new pathways. The most important of these problems is the study of neurodegenerative diseases, especially Parkinson’s, Alzheimer’s and Lou Gehrig’s diseases, with a view toward determining the functions of gene products associated with such diseases, finding common pathways that underlie them, and the search for and design of specific inhibitors or activators as potential drugs to treat these diseases. A number of methods are being used, from development of genetic and in vitro screens to structure-based drug design methods.

Tandy Warnow, The University of Illinois at Urbana-Champaign

Title: Genome-scale estimation of the Tree of Life
Abstract: Estimating the Tree of Life is one of the grand computational challenges in Science, and has applications to many areas of science and biomedical research. Despite intensive research over the last several decades, many problems remain inadequately solved. In this talk I will discuss species tree estimation from genome-scale datasets. I will describe the current state of the art for these problems, what is understood about these problems from a mathematical perspective, and identify some of the open problems in this area where mathematical research, drawing from graph theory, combinatorial optimization, and probability and statistics, is needed. This talk will be accessible to mathematicians, computer scientists, probabilists and statisticians, and does not require any knowledge of biology.

Biography: Tandy Warnow is the Founder Professor of Engineering at the University of Illinois at Urbana-Champaign, where she has a dual appointment between Computer Science and Bioengineering. She is also a member of the Carl R. Woese Institute for Genomic Biology and an affiliate in six other departments at UIUC (Statistics, Mathematics, Electrical and Computer Engineering, Plant Biology, Animal Biology, and Entomology). Tandy received her PhD in Mathematics at UC Berkeley under the direction of Gene Lawler, and did postdoctoral training with Simon Tavar´e and Michael Waterman at the University of Southern California. She received the National Science Foundation Young Investigator Award in 1994, the David and Lucile Packard Foundation Award in Science and Engineering in 1996, an Emeline Bigelow Conland Fellowship at the Radcliffe Institute for Advanced Study in 2006, and a Guggenheim Foundation Fellowship for 2011. In 2016 she was elected as an ACM Fellow, and in 2017 she was elected as a ISCB Fellow. Her research combines mathematics, computer science, and statistics to develop improved models and algorithms for reconstructing complex and large-scale evolutionary histories in both biology and historical linguistics. Her current research focuses on phylogeny and alignment estimation for very large datasets (10,000 to 1,000,000 sequences), estimating species trees from collections of gene trees, and metagenomics.

Special Invited Talk

Enoch Huang, Pfizer

Title: Demystifying Careers in the Biopharmaceutical Industry
Abstract: Come hear an insider perspective on careers in the biopharmaceutical sector: navigating the hiring process, what roles, skills and capabilities are in demand, factors to consider before pursuing a job in industry, and potential paths for professional advancement. This presentation will focus on vignettes and examples pertaining to computational roles in the context of the wider context of drug discovery and development.

Biography: Enoch S. Huang received an AB in Molecular Biology from Princeton and a PhD in Structural Biology from Stanford, where he was a NSF Pre-doctoral Fellow with Prof. Michael Levitt. He was a Jane Coffin Childs Fellow at Washington University School of Medicine. After starting his computational biology career at Cereon Genomics, he joined Pfizer’s Cambridge laboratories in 2000. In 2001, he was appointed Head of Molecular Informatics and joined the site leadership team. In 2007 he accepted a global role as Head of Computational Sciences.

Enoch has been an Adjunct Assistant Professor of Bioinformatics at Boston University since 2001. He has served on external advisory boards for Drug Discovery Today, Brandeis University, the International Society for Computational Biology, the NIH "Illuminating the Druggable Genome" program, the Rochester Institute of Technology, and the Minnesota Supercomputing Institute. Enoch has also served on the program committees at the New York Academy of Sciences, the Massachusetts Biotechnology Council, and on NIH study sections. He has authored over 30 publications and released the Open Source software package PFAAT.

Schedule at a Glance

The schedule can also be found in the Program Booklet.

Sunday, August 20
8-10 a.m. CAME: 6th Workshop on Computational Advances in Molecular Epidemiology CNB-MAC: The Fourth International Workshop on Computational Network Biology: Modeling, Analysis, and Control CSBW: Computational Structural Biology Workshop MMM: Workshop on Microbiomics, Metagenomics, and Metabolomics Tutorial: Introducing the New eICU Collaborative Research Database Tutorial: Computational modeling of protein‐RNA interactions
10-10:30 a.m. Coffee Break
10:30-noon Tutorial: Robotics-inspired Algorithms for Modeling Protein Structures and Motions
12-1:30 p.m. Lunch on your own
1:30-3:30 p.m. Student Mentoring Workshop ParBio: Sixth Workshop on Parallel and Cloud-Based Bioinformatics and Biomedicine
3:30-4 p.m. Coffee Break
4-5:30 p.m. Tutorial: Stochastic Process Model and Its Applications to Analysis of Longitudinal Data
6-8 p.m. Student Social Event
Monday, August 21
8:45-9 a.m. Opening and Welcome Remarks
9-10 a.m. Keynote: Shawn Murphy
10-10:30 a.m. Coffee Break
10:30-noon Cancer Genomics and Inferring Phylogenies and Haplotypes

Pancancer Modelling Predicts the Context-specific Impact of Somatic Mutations on Transcriptional Programs

Beyond Perfect Phylogeny: Multisample Phylogeny Reconstruction via ILP

A Compatibility Approach to Identify Recombination Breakpoints in Sequence Alignments

Phylogenetic tree based method for uncovering co-mutational site-pairs in influenza viruses

Text Mining and Classification

Mapping Free Text into MedDRA by Natural Language Processing: a Modular Approach in Designing and Evaluating Software Extensions

Dependency Embeddings and AMR Embeddings for Drug-Drug Interaction Extraction from Biomedical Texts

Leveraging Sentiment Analysis for Classifying Patient Complaints

Identifying Harm Events in Clinical Care through Medical Narratives

Proteins and RNA Structure, Dynamics, and Analysis I

Fast, Clash-free RNA Conformational Morphing Using Molecular Junctions

GOstruct 2.0: Automated Protein Function Prediction for Annotated Proteins

Identification and Prediction of Intrinsically Disordered Regions in Proteins using n-grams

Deep recurrent conditional random field network for protein secondary prediction

12-2 p.m. Lunch on your own
2-3:30 p.m. Genomic Variation and Disease

CERENKOV: Computational Elucidation of the REgulatory NonKOding Variome

Exploring Frequented Regions in Pan-Genomic Graphs

Bayesian Hyperparameter Optimization for Machine Learning Based eQTL Analysis

Coal-Miner: a Coalescent-based Method for GWA Studies of Quantitative Traits with Complex Evolutionary Origins

Clinical Databases and Information Systems

Learning Deep Representations from Heterogeneous Patient Data for Predictive Diagnosis

Winnow: Interactive Visualization of Temporal Changes in Multidimensional Clinical Data

Associating Genomics and Clinical Information by Means of Semantic Based Ranking

Tailoring Training for Obese Individuals Through a Case-Based System

Big Data in Bioinformatics I

SparkGA: A Spark Framework for Cost Effective, Fast and Accurate DNA Analysis at Scale

Fast and Highly Scalable Bayesian MDP on a GPU Platform

Inferring Microbial Interactions from Metagenomic Time-series Using Prior Biological Knowledge

Secure Cloud Computing for Pairwise Sequence Alignment

3:30-4 p.m. Coffee Break
4-5:30 p.m. ACM SIGBio General Meeting
6-8 p.m. Poster Session & Reception
Tuesday, August 22
8:45 a.m.-9 a.m. Opening and Welcome Remarks
9-10 a.m. Keynote: Dagmar Ringe
10-10:30 a.m. Coffee Break
10:30-noon Advancing Algorithms and Methods I

Rich Chromatin Structure Prediction from Hi-C Data

Cophenetic Median Trees Under the Manhattan Distance

DeepCCI: End-to-end Deep Learning for Chemical-Chemical Interaction Prediction

Automated Diagnosis and Prediction I

A Multi-view Deep Learning Method for Epileptic Seizure Detection using Short-time Fourier transform

TUCUXI – An Intelligent System for Personalized Medicine: from Individualization of Treatments to Research Databases and Back

Interpretable Predictions of Clinical Outcomes with An Attention-based Recurrent Neural Network

Confused or not Confused? Disentangling Brain Activity from EEG Data Using Bidirectional LSTM Recurrent Neural Networks

Protein and RNA Structure, Dynamics, and Analysis I

3D Genome Structure Modeling by Lorentzian Objective Function

Folding Large Proteins by Ultra-Deep Learning

Predicting the Effect of Point Mutations on Protein Structural Stability

12-1:30 p.m. Boxed Lunch Women in Bioinformatics Panel - The Panelists - Introduction Slides
1:30-2 p.m. Keynote: Enoch Huang
2-3:30 p.m. Industry Panel - The Panelists Advancing Algorithms and Methods II

Synthesizing Species Trees from Unrooted Gene Trees: A Parameterized Approach

Fleximer: Accurate Qantification of RNA-Seq via Variable-Length k-mers

A Sparse Latent Regression Approach for Integrative Analysis of Glycomic and Glycotranscriptomic Data

Use of Structural Properties of Underlying Graphs in Pathway Enrichment Analysis of Genomic Data

Applications to Microbes and Imaging Genetics

Seq2seq Fingerprint: An Unsupervised Deep Molecular Embedding for Drug Discovery

Detection of Differential Abundance Intervals in Longitudinal Metagenomic Data Using Negative Binomial Smoothing Spline ANOVA

Cell Neighbor Determination in the Metazoan Embryo System

Predictive and Comparative Network Analysis of the Gut Microbiota in Type 2 Diabetes

3:30-4 p.m. Coffee Break
4-6:00 p.m. NSF Sponsored Student Research Forum - Abstracts Demos and Exhibits
6:30-8 p.m. Dinner Banquet
Wednesday, August 23
8:45 a.m.-9 a.m. Opening and Welcome Remarks
9-10 a.m. Keynote: Tandy Warnow
10-10:30 a.m. Coffee Break
10:30-noon Systems Biology I

Network-based Interpretation of Diverse High-throughput Datasets through the Omics Integrator Software Package

Bayesian Collective Markov Random Fields for Subcellular Localization Prediction of Human Proteins

Differential Community Detection in Paired Biological Networks

TINTIN: Exploiting Target Features for Signaling Network Similarity Computation and Ranking

Knowledge Representation Applications

Determining Optimal Features for Predicting Type IV Secretion System Effector Proteins for Coxiella burnetii

Knowledge Rich Natural Language Queries over Structured Biological Databases

Discovering Inconsistencies in PubMed abstracts through Ontology-based Information Extraction

TRuML: A Translator for Rule-Based Modeling Languages

Integrative Methods for Genomic Data

Analysis of Single Cells on a Pseudotime Scale along Postnatal Pancreatic Beta Cell Development

HEMnet: Integration of Electronic Medical Records with Molecular Interaction Networks and Domain Knowledge for Survival Analysis

A Novel Approach for Classifying Gene Expression Data using Topic Modeling

Differential Compound Prioritization via Bi-Directional Selectivity Push with Power

12-2 p.m. Lunch Buffet by conference hotel
2-3:30 p.m. Sequence Analysis and Genome Assembly

Understanding Sequence Conservation with Deep Learning

SeqyClean: a Pipeline for High-throughput Sequence Data Preprocessing

Distributed Memory Partitioning of High-Throughput Sequencing Datasets for Enabling Parallel Genomics Analyses

Scalable Genomic Assembly through Parallel de Bruijn Graph Construction for Multiple K-mers

Applications to Healthcare Processes

Drug Response Prediction as a Link Prediction Problem

Co-MEAL: Cost-Optimal Multi-expert Active Learning Architecture for Mobile Health Monitoring

Antidote Application: an Educational System for Treatment of Common Toxin Overdose

Automated Off-label Drug Use Detection from User Generated Content

Biological Modeling

Model-based Transcriptome Engineering

Circadian Rhythms in Neurospora Exhibit Biologically Relevant Driven and Damped Harmonic Oscillations

Hybrid ODE/SSA Model of the Budding Yeast Cell Cycle Control Mechanism with Mutant Case Study

RBFNN-based Modelling and Analysis for the Signal Reconstruction of Peripheral Nerve Tissue

3:30-4 p.m. Coffee Break
4-5:30 p.m. Systems Biology II

Reverse Engineering Gene Networks: A Comparative Study at Genome-scale

Unsupervised Multi-View Feature Selection for Tumor Subtype Identification

Computational Intractability Generates the Topology of Biological Networks

A Flexible and Robust Multi-Source Learning Algorithm for Drug Repositioning

Automated Diagnosis and Prediction II

Tensor-Factorization-Based Phenotyping using Group Information: Case Study on the Efficacy of Statins

Infer Cause of Death for Population Health Using Convolutional Neural Network

Automated Breast Cancer Diagnosis Using Deep Learning and Region of Interest Detection (BC-DROID)

A Physiological Thermal Regulation Model with Application to the Diagnosis of Diabetic Peripheral Neuropathy

Big Data in Bioinformatics II

An Out-of-Core GPU Based Dimensionality Reduction Algorithm for Big Mass Spectrometry Data and its Application in Bottom-up Proteomics

Building Applications for Interactive Data Exploration in Systems Biology

Mining Faces from Biomedical Literature using Deep Learning

A Correlation Network Model Utilizing Gait Parameters for Evaluating Health Levels

5:30-5:45 p.m. Closing Remarks

Program Booklet

View the Program Booklet

Important Dates
Call for Submission Deadline Notification of Acceptance
Papers April 24 June 7
Workshops March 17 March 20
Tutorials March 17 March 20
Highlights May 8 June 7
Posters June 11 June 19

News

ACM-BCB 2017 papers now available through ACM DL

August 17, 2017

ACM-BCB 2017 tutorials are now posted

August 11, 2017

Preliminary program booklet posted

June 22, 2017

Poster submission deadline extended to June 11

May 25, 2017

Registration is open

May 17, 2017

Paper submission deadline extended to April 24

April 8, 2017

ACM-BCB 2017 Call for Posters dates updated

April 8, 2017

ACM-BCB 2017 Call for Highlights is now posted

March 13, 2017

Workshop proposal deadline extended to March 17

March 10, 2017

Venue information is available

January 18, 2017

ACM-BCB 2017 Call for Papers is now posted

January 5, 2017


Sponsors