Dan Biderman

Postdoctoral Scholar at Stanford Statistics & CS

Linderman Lab, Stanford Statistics

Hazy Research (Ré) Lab, Stanford CS

I am a Postdoctoral Scholar at Stanford University, co-advised by Scott Linderman and Christopher Ré.

I build resource-efficient AI systems for science fusing approaches from statistical ML and CS systems. I work on two main themes currently. In one, I design and try to understand collaboration patterns between language models of varying capabilties across heterogeneous hardware (see the Minions project). In the other, I develop hardware-aware algorithms at the intersection of numerical linear algebra and ML (more on this soon!)

I recently graduated from a PhD at Columbia’s Center for Theoretical Neuroscience, advised by John Cunningham and working closely with Liam Paninski. During that time, I worked on deep learning systems for tracking animal movement in videos - the Lightning Pose package (Nature Methods, 2024), scalability of Gaussian processes (ICML, 2021), and learning-forgetting tradeoffs in LLM finetuning (TMLR, 2024 (Featured Certification)). The latter work was done during my long internship at MosaicML / Databricks.

Here is my CV.

Interests

Efficient LLMs and multi-agent systems.
Hardware-aware approaches to numerical linear algebra and ML.
Modeling and analysis of biological data.

Education

PhD in Computational Neuroscience, 2018-2024
Columbia University
MA in Cognitive Science, 2018
Tel Aviv University
The Adi Lautman Interdisciplinary Program for Outstanding Students (Cog. Sci., Math, Neurobio.), 2013-2017
Tel Aviv University

Featured Publications

Dan Biderman, Jacob Portes, Jose Gonzalez Ortiz, Mansheej Paul, Philip Greengard, Connor Jennings, Daniel King, Sam Havens, Vitaliy Chiley, Jonathan Frankle, Cody Blakeney, John P Cunningham

May, 2024 TMLR, 2024 (Featured Certification)

LoRA Learns Less and Forgets Less

Compares LoRA versus full-parameter finetuning on challenging code and math tasks; sheds light on the learning-forgetting tradeoffs. Showing that LoRA usually underperforms full finetuning in a new target domain while forgetting less of the source domain.

Avanika Narayan, Dan Biderman, Sabri Eyuboglu, Avner May, Scott Linderman, James Zou, Christopher Ré

May, 2024 arXiv, 2025

Minions: Cost-efficient Collaboration Between On-device and Cloud Language Models

Small on-device LMs guided by frontier cloud-hosted LMs to solve workloads on device at the fraction of the cost.

Dan Biderman, Matthew R Whiteway, Cole Hurwitz, Nicholas Greenspan, Many others, John P. Cunningham, Nathaniel Sawtell, Liam Paninski

May, 2023 Nature Methods, 2024

Lightning Pose: improved animal pose estimation via semi-supervised learning, Bayesian ensembling, and cloud-native open-source tools

Introduces a semi-supervised approach to pose estimation, using physically-informed inductive biases to improve generalization with fewer labels. Poses are further refined by combining deep ensembles with state-space models. Open-sourcing a deep learning system that is optimized for efficiency, building on PyTorch Lightning and NVIDIA DALI.

Matthew Whiteway, Dan Biderman, others, John P. Cunningham, Liam Paninski

September, 2021 In PLoS Comp. Biol

Partitioning variability in animal behavioral videos using semi-supervised variational autoencoders

This model disentangles movement that can be quantified by keypoints (e.g., limb position) from subtler feature variations like orofacial movements. We introduce a novel VAE whose latent space decomposes into two orthogonal subspaces – one unsupervised subspace and one supervised subspace linearly predictive of labels (keypoints). The latent space additionally includes a context variable that predicts the video/subject identity.

Andres Potapczynski, Luhuan Wu, Dan Biderman, Geoff Pleiss, John P. Cunningham

June, 2021 In ICML 2021

Bias-Free Scalable Gaussian Processes via Randomized Truncations

Deriving systematic biases in scalable GPs; eliminating them using Russian-Roulette estimators.

Dan Biderman, Christian Naesseth, Luhuan Wu, Taiga Abe, Alice Mosberger, Leslie Sibener, Rui Costa, James Murray, John P. Cunningham

December, 2020 In NeurIPS DiffCVGP 2020 (Oral)

Inverse Articulated-Body Dynamics from Video via Variational Sequential Monte Carlo

Estimate pose using a CNN; Model pose as a rigid-body system; infer its angular dynamics AND the external torques acting on it using a new nested variant of VSMC.

Recent Publications

Quickly discover relevant content by filtering publications.

Dan Biderman, Jacob Portes, Jose Gonzalez Ortiz, Mansheej Paul, Philip Greengard, Connor Jennings, Daniel King, Sam Havens, Vitaliy Chiley, Jonathan Frankle, Cody Blakeney, John P Cunningham (2024). LoRA Learns Less and Forgets Less. TMLR, 2024 (Featured Certification).

PDF Cite

Avanika Narayan, Dan Biderman, Sabri Eyuboglu, Avner May, Scott Linderman, James Zou, Christopher Ré (2024). Minions: Cost-efficient Collaboration Between On-device and Cloud Language Models. arXiv, 2025.

PDF Cite

International Brain Laboratory, Kush Banga, Julius Benson, Jai Bhagat, Dan Biderman (2023). Reproducibility of in-vivo electrophysiological measurements in mice. bioRxiv 2023 (under review).

PDF Cite

Dan Biderman, Matthew R Whiteway, Cole Hurwitz, Nicholas Greenspan, Many others, John P. Cunningham, Nathaniel Sawtell, Liam Paninski (2023). Lightning Pose: improved animal pose estimation via semi-supervised learning, Bayesian ensembling, and cloud-native open-source tools. Nature Methods, 2024.

PDF Cite Code NVIDIA TechBlog

Matthew Whiteway, Dan Biderman, others, John P. Cunningham, Liam Paninski (2021). Partitioning variability in animal behavioral videos using semi-supervised variational autoencoders. In PLoS Comp. Biol.

PDF Cite Source Document

Andres Potapczynski, Luhuan Wu, Dan Biderman, Geoff Pleiss, John P. Cunningham (2021). Bias-Free Scalable Gaussian Processes via Randomized Truncations. In ICML 2021.

PDF Cite Code Slides Video Source Document

Dan Biderman, Christian Naesseth, Luhuan Wu, Taiga Abe, Alice Mosberger, Leslie Sibener, Rui Costa, James Murray, John P. Cunningham (2020). Inverse Articulated-Body Dynamics from Video via Variational Sequential Monte Carlo. In NeurIPS DiffCVGP 2020 (Oral).

PDF Cite Video

Dan Biderman, Yarden Shir, Liad Mudrik (2020). B or 13? Unconscious Top-Down Contextual Effects at the Categorical but Not the Lexical Level. In Psychological Science.

PDF Cite Code Dataset Source Document

Eleanor Batty, Matthew Whiteway, Shreya Saxena, Dan Biderman, Taiga Abe, Simon Musall, Winthrop Gillis, Jeffrey Markowitz, Anne Churchland, John P. Cunningham, Sandeep R. Datta, Scott Linderman, Liam Paninski (2019). BehaveNet: nonlinear embedding and Bayesian neural decoding of behavioral videos. In NeurIPS 2019.

PDF Cite Source Document

Dan Biderman, Natalie Biderman, Alon Zivony, Dominique Lamy (2017). Contingent Capture Is Weakened in Search for Multiple Features From Different Dimensions. In JEP HPP.

PDF Cite Source Document

Contact

biderman@stanford.edu