I build resource-constrained machine learning (ML) systems for science – in vision, timeseries, and text domains – fusing approaches from statistical ML and CS systems.

I am currently a final-year PhD candidate advised by John Cunningham and working closely with Liam Paninski. I am also a part-time NLP researcher at MosaicML/Databricks, under Jonathan Frankle, where I work on LLM parameter-efficient finetuning, evaluation, and data, with an emphasis on code generation.

In my primary PhD work, I develop semi-supervised computer vision systems for tracking animals in videos, reducing the amount of labeled data needed for the task and improving generalization. Our package lightning-pose (bioRxiv, 2023; under review) is widely used in science and industry. I have also tackled this problem via probabilistic representation learning (NeurIPS, 2019, PLOS Comp. Biol., 2021), 3D vision, and physical simulation (NeurIPS DiffCVGP, 2020). In a second line of work, I focus on the computational efficiency and inductive biases of Gaussian processes (ICML, 2021). My ongoing NLP work at MosaicML addresses questions of knowledge acquisition and extinction and its interaction with parameter-efficient finetuning methods (more soon!).

Throughout my PhD, I collaborated closely with Lightning AI, named a Lightning Ambassador, and a featured developer in their first DevCon, June 2022.

Here is my CV.

Interests
  • Gaussian Processes and state-space models.
  • LLM finetuning, evaluation, data curation, and codegen.
  • Pose estimation and inverse control problems.
  • Computational Neuroscience and Neuroethology.
Education
  • PhD in Computational Neuroscience, 2018-

    Columbia University

  • MA in Cognitive Science, 2018

    Tel Aviv University

  • The Adi Lautman Interdisciplinary Program for Outstanding Students (Cog. Sci., Math, Neurobio.), 2013-2017

    Tel Aviv University

Recent Publications

Quickly discover relevant content by filtering publications.
(2023). Reproducibility of in-vivo electrophysiological measurements in mice. bioRxiv 2023 (under review).

PDF Cite

(2023). Lightning Pose: improved animal pose estimation via semi-supervised learning, Bayesian ensembling, and cloud-native open-source tools. Under review.

PDF Cite Code NVIDIA TechBlog

(2021). Partitioning variability in animal behavioral videos using semi-supervised variational autoencoders. In PLoS Comp. Biol.

PDF Cite Source Document

(2021). Bias-Free Scalable Gaussian Processes via Randomized Truncations. In ICML 2021.

PDF Cite Code Slides Video Source Document

(2020). Inverse Articulated-Body Dynamics from Video via Variational Sequential Monte Carlo. In NeurIPS DiffCVGP 2020 (Oral).

PDF Cite Video

(2019). BehaveNet: nonlinear embedding and Bayesian neural decoding of behavioral videos. In NeurIPS 2019.

PDF Cite Source Document

(2017). Contingent Capture Is Weakened in Search for Multiple Features From Different Dimensions. In JEP HPP.

PDF Cite Source Document