Standard physics modeling approaches rely on symmetries and conservation laws to determine the relevant degrees of freedom and their dynamical rules. This method is challenged in biological physics where, for example, neither proteins nor genes can be identified through symmetries, but nonetheless play a significant role in an organism’s behavior. Instead, across biology, data-driven methods are now being used to unravel the physical laws underlying complex living systems. These approaches often take the form of inverse problems, aiming to infer underlying stochastic equations of motion or other statistical mechanics descriptions.
Examples of systems where these approaches are applied range from chromosome organization and interacting proteins to cell motility, and from postural dynamics in animals, to the movement of interacting swarms of fish, insects or birds. These inverse problems are notoriously hard, posing exciting challenges for theory. Such data-driven theory can now capitalize on the recent surge in high-quality data coming from rapid advances in quantitative experiments across biology. An example is given by the explosion of available genome sequences, allowing data-driven modeling of biological sequence data. Other example is in the broad area of morphogenesis where a complex interplay of genes, protein and biomechanical fields can be elucidated with neural networks thanks to rapid advances in imaging and optogenetics techniques.
Modern statistical inference and machine learning (ML) methods can extract generalizable rules from such large datasets and learn to approximate existing underlying functional relationships. For instance, in the case of protein structure determination, pairwise maximum entropy models brought substantial advances, and deep learning models based on natural language processing methods contributed to recent breakthroughs.
This four-hour tutorial will provide an introduction to state-of-the-art machine learning and model inference techniques and include interactive sessions of applying these methods to real biophysical data.
- Machine learning techniques including neural networks and deep learning
- Data Analysis, statistics, clustering, classification, regression
- Maximum entropy methods, model selection and inference
- Practical applications with analysis of real biophysical data Instructors Chase Broedersz, Vrije Universiteit Amsterdam Pankaj Mehta, Boston University Vincenzo Vitelli, University of Chicago Anne-Florence Bitbol, EPFL
- Chase Broedersz, Vrije Universiteit Amsterdam
- Steve Presse, Arizona State University
- Vincenzo Vitelli, University of Chicago
- Anne-Florence Bitbol, EPFL
Who Can Attend?
Practicing biological physicists at all career stages, especially students and early career researchers. The tutorial talks will be very pedagogical, describing the theoretical foundations and tools of the field as well as applications. Latest developments and open questions will also be prominently featured.