Aaron Gokaslan

I am a 4th year PhD Candidate at Cornell University advised by Volodymyr Kuleshov. I also work closely with James Grimmelmann, Noah Snavely, and Sasha Rush. Previously, I worked at Facebook AI Research advised by Dhruv Batra. Before that, I did my masters and undergrad at Brown University with James Tompkin.

My research focuses identifying, designing, and building efficient, scalable, sustainable, and affordable abstractions and infrastructure for generative modeling research. I also do work at the intersection of data, law, and AI policy.

My work has been recognized by orals and invited talks at top conference including NeurIPS, ECCV, ICML, and CVPR. I am a Mozilla RISE25 2024 honoree, and I have received awards for my open source contributions from the Linux Foundation and Mozilla.

I maintain pybind11, PyTorch, and other popular open source libraries. I released one of the first 1 billion parameter+ auto-regression large language models OpenGPT2. I have contributed to the development of popular open source generative AI artifacts including OpenWebText, OpenGPT2, CommonCanvas, CommonCatalog, CommonPile, Habitat-Matterport3D, DataComp-LM, and BLOOM. These works have been collectively downloaded millions of times. I also helped create the Responsible AI License (RAIL) as a co-chair of the BLOOM Workshop. Additionally, I serve on the advisory board of EncodeJustice and Fidutam.

My work has been featured by WIRED, CNN, TechCrunch, and others.

I am currently on the academic and industry job market. I will also be attending NeurIPS 2024 in Vancouver. Please see my Google Scholar for my most up to date publication list.

Email  /  CV  /  Google Scholar  /  Twitter  /  Github

profile photo
Mozilla Rise25 Award Video - 3 Min Intro.
Research
MeMDLM: De Novo Membrane Protein Design with Masked Discrete Diffusion Protein Language Models.
Shrey Goel, Vishrut Thoutam, Edgar Mariano Marroquin, Aaron Gokaslan, Arash Firouzbakht, Sophia Vincoff, Volodymyr Kuleshov, Huong T. Kratochvil, Pranam Chatterjee
Arxiv, 2024
Self-Directed Synthetic Dialogues and Revisions Technical Report.
Nathan Lambert, Hailey Schoelkopf, Aaron Gokaslan, Luca Soldaini, Valentina Pyatkin, Louis Castricato
Arxiv, 2024
DataComp-LM: In Search of the Next Generation of Training Sets for Language Models.
Jeffrey Li, Alex Fang, Georgios Smyrnis, Maor Ivgi, Matt Jordan, Samir Gadre, Hritik Bansal, Etash Guha, Aaron Gokaslan, et al.
NeurIPS, 2024
Vid3D: Synthesis of Dynamic 3D Scenes using 2D Video Diffusion.
Rishab Parthasarathy, Zachary Ankner, Aaron Gokaslan
Arxiv, 2024
Simple and Effective Masked Diffusion Language Models.
Subham Sekhar Sahoo, Marianne Arriola, Yair Schiff, Aaron Gokaslan, Edgar Marroquin, Justin T Chiu, Alexander Rush, Volodymyr Kuleshov
NeurIPS, 2024
Oral at BioML Workshop ICML 2024
Diffusion Models With Learned Adaptive Noise.
Subham Sekhar Sahoo, Aaron Gokaslan, Chris De Sa, Volodymyr Kuleshov
NeurIPS, 2024
Spotlight: Top 3% of papers of 15,671 submissions
Cross-species Modeling of Plant Genomes at Single Nucleotide Resolution Using a Pre-trained DNA Language Model.
Jingjing Zhai, Aaron Gokaslan, Yair Schiff, Ana Berthel, Zong-Yan Liu, Zachary R. Miller, Armin Scheben, Michelle C. Stitzer, M. Cinta Romay, Edward S. Buckler, Volodymyr Kuleshov
Biorxiv, 2024
Caduceus: Bi-Directional Equivariant Long-Range DNA Sequence Modeling.
Yair Schiff, Chia-Hsiang Kao, Aaron Gokaslan, Tri Dao, Albert Gu, Volodymyr Kuleshov
Arxiv, 2024
On the Standardization of Behavioral Use Clauses and Their Adoption for Responsible Licensing of AI.
Daniel McDuff, Tim Korjakow, Scott Cambo, Jesse Josua Benjamin, Jenny Lee, Yacine Jernite, Carlos Muñoz Ferrandis, Aaron Gokaslan, Alek Tarkowski, Joseph Lindley, A Feder Cooper, Danish Contractor
Arxiv, 2024
Advancing DNA Language Models: The Genomics Long-Range Benchmark.
Chia Hsiang Kao, Evan Trop, McKinley Polen, Yair Schiff, Bernardo P de Almeida, Aaron Gokaslan, Thomas Pierrot, Volodymyr Kuleshov
ICML SPIGM Workshop, 2024
Oral Presentation
CommonCanvas: An Open Diffusion Model Trained with Creative-Commons Images.
Aaron Gokaslan, A. Feder Cooper, Jasmine Collins, Landan Seguin, Austin Jacobson, Mihir Patel, Jonathan Frankle, Cory Stephenson, Volodymyr Kuleshov
CVPR, 2024
Accepted to CVPR 2024
Presented at NeurIPS 2023 Diffusion and Content Creativity Workshops
InfoDiffusion: Representation Learning Using Information Maximizing Diffusion Models.
Yingheng Wang, Yair Schiff, Aaron Gokaslan, Weishen Pan, Fei Wang, Christopher De Sa, Volodymyr Kuleshov
ICML, 2023
Galactic: Scaling End-to-End Reinforcement Learning for Rearrangement at 100k Steps-per-Second.
Vincent-Pierre Berges, Andrew Szot, Devendra Singh Chaplot, Aaron Gokaslan, Roozbeh Mottaghi, Dhruv Batra, Eric Undersander
CVPR, 2023
The BigScience ROOTS Corpus: A 1.6TB Composite Multilingual Dataset.
Hugo Laurençon, Lucile Saulnier, Thomas Wang, Christopher Akiki, Albert Villanova del Moral, Teven Le Scao, Leandro Von Werra, Chenghao Mou, Eduardo Gonzålez Ponferrada, Huu Nguyen, Jörg Frohberg, Mario Ơaƥko, Quentin Lhoest, Angelina McMillan-Major, Gérard Dupont, Stella Biderman, Anna Rogers, Loubna Ben Allal, Francesco De Toni, Giada Pistilli, Olivier Nguyen, Somaieh Nikpoor, Maraim Masoud, Pierre Colombo, Javier de la Rosa, Paulo Villegas, Tristan Thrush, Shayne Longpre, Sebastian Nagel, Leon Weber, Manuel Romero Muñoz, Jian Zhu, Daniel Van Strien, Zaid Alyafeai, Khalid Almubarak, Vu Minh Chien, Itziar Gonzalez-Dios, Aitor Soroa, Kyle Lo, Manan Dey, Pedro Ortiz Suarez, Aaron Gokaslan, Shamik Bose, David Ifeoluwa Adelani, Long Phan, Hieu Tran, Ian Yu, Suhas Pai, Jenny Chim, Violette Lepercq, Suzana Ilic, Margaret Mitchell, Sasha Luccioni, Yacine Jernite
NeurIPS, 2022
Featured Paper: Oral Presentation (≈ 1% acceptance rate)
Data Governance in the Age of Large-Scale Data-Driven Language Technology.
Yacine Jernite, Huu Nguyen, Stella Biderman, Anna Rogers, Maraim Masoud, Valentin Danchev, Samson Tan, Alexandra Sasha Luccioni, Nishant Subramani, Isaac Johnson, Gerard Dupont, Jesse Dodge, Kyle Lo, Zeerak Talat, Dragomir R. Radev, Aaron Gokaslan, Somaieh Nikpoor, Peter Henderson, Rishi Bommasani, Margaret Mitchell
FAccT, 2022
Habitat-Matterport 3D Semantics Dataset.
Karmesh Yadav, Ram Ramrakhya, Santhosh Kumar Ramakrishnan, Théophile Gervet, John Turner, Aaron Gokaslan, Noah Maestre, Angel Xuan Chang, Dhruv Batra, Manolis Savva, Alexander William Clegg, Devendra Singh Chaplot
arXiv, 2022
BLOOM: A 176B-Parameter Open-Access Multilingual Language Model.
Teven Le Scao, Angela Fan, Christopher Akiki, Ellie Pavlick, Suzana Ilic, Daniel Hesslow, Roman Castagné, Alexandra Sasha Luccioni, François Yvon, Matthias Gallé, Jonathan Tow, Alexander M. Rush, Stella Biderman, Albert Webson, Pawan Sasanka Ammanamanchi, Thomas Wang, Benoßt Sagot, Niklas Muennighoff, Albert Villanova del Moral, Olatunji Ruwase, Rachel Bawden, Stas Bekman, Angelina McMillan-Major, Iz Beltagy, Huu Nguyen, Lucile Saulnier, Samson Tan, Pedro Ortiz Suarez, Victor Sanh, Hugo Laurençon, Yacine Jernite, Julien Launay, Margaret Mitchell, Colin Raffel, Aaron Gokaslan, Adi Simhi, Aitor Soroa, Alham Fikri Aji, Amit Alfassy, Anna Rogers, Ariel Kreisberg Nitzav, Canwen Xu, Chenghao Mou, Chris Emezue, Christopher Klamm, Colin Leong, Daniel van Strien, David Ifeoluwa Adelani, et al.
arXiv, 2022
GaussiGAN: Controllable Image Synthesis with 3D Gaussians from Unposed Silhouettes.
Youssef Alami Mejjati, Isa Milefchik, Aaron Gokaslan, Oliver Wang, Kwang In Kim, James Tompkin
BMVC, 2021
Waypoint Models for Instruction-guided Navigation in Continuous Environments.
Jacob Krantz, Aaron Gokaslan, Dhruv Batra, Stefan Lee, Oleksandr Maksymets
ICCV, 2021
Oral Presentation: Top (3%/210) of all (6236) submission
THDA: Treasure Hunt Data Augmentation for Semantic Navigation.
Oleksandr Maksymets, Vincent Cartillier, Aaron Gokaslan, Erik Wijmans, Wojciech Galuba, Stefan Lee, Dhruv Batra
ICCV, 2021
TöRF: Time-of-Flight Radiance Fields for Dynamic Scene View Synthesis.
Benjamin Attal, Eliot Laidlaw, Aaron Gokaslan, Changil Kim, Christian Richardt, James Tompkin, Matthew O'Toole
NeurIPS, 2021
Habitat-Matterport 3D Dataset (HM3D): 1000 Large-scale 3D Environments for Embodied AI.
Santhosh Kumar Ramakrishnan, Aaron Gokaslan, Erik Wijmans, Oleksandr Maksymets, Alexander Clegg, John Turner, Eric Undersander, Wojciech Galuba, Andrew Westbury, Angel X. Chang, Manolis Savva, Yili Zhao, Dhruv Batra
NeurIPS Datasets and Benchmarks, 2021
Habitat 2.0: Training Home Assistants to Rearrange their Habitat.
Andrew Szot, Alexander Clegg, Eric Undersander, Erik Wijmans, Yili Zhao, John Turner, Noah Maestre, Mustafa Mukadam, Devendra Singh Chaplot, Oleksandr Maksymets, Aaron Gokaslan, Vladimir Vondrus, Sameer Dharur, Franziska Meier, Wojciech Galuba, Angel X. Chang, Zsolt Kira, Vladlen Koltun, Jitendra Malik, Manolis Savva, Dhruv Batra
NeurIPS, 2021
Spotlight: Top 3% of papers
OpenGPT-2: open language models and implications of generated text.
Vanya Cohen, Aaron Gokaslan
XRDS, 2020
Sim2Real Predictivity: Does Evaluation in Simulation Predict Real-World Performance?
Abhishek Kadian, Joanne Truong, Aaron Gokaslan, Alexander Clegg, Erik Wijmans, Stefan Lee, Manolis Savva, Sonia Chernova, Dhruv Batra
IEEE Robotics Autom. Lett., 2020 and International Conference on Intelligent Robots and Systems, 2020
MatryODShka: Real-time 6DoF Video View Synthesis Using Multi-sphere Images.
Benjamin Attal, Selena Ling, Aaron Gokaslan, Christian Richardt, James Tompkin
ECCV, 2020
Oral Presentation: Top 2% out of 5025 submissions
Generating Object Stamps.
Youssef Alami Mejjati, Zejiang Shen, Michael Snower, Aaron Gokaslan, Oliver Wang, James Tompkin, Kwang In Kim
arXiv, 2020
ObjectNav Revisited: On Evaluation of Embodied Agents Navigating to Objects.
Dhruv Batra, Aaron Gokaslan, Aniruddha Kembhavi, Oleksandr Maksymets, Roozbeh Mottaghi, Manolis Savva, Alexander Toshev, Erik Wijmans
arXiv, 2020
Learning Deep Parameterized Skills from Demonstration for Re-targetable Visuomotor Control.
Jonathan Chang, Nishanth Kumar, Sean Hastings, Aaron Gokaslan, Diego Romeres, Devesh K. Jha, Daniel Nikovski, George Dimitri Konidaris, Stefanie Tellex
arXiv, 2019
b3do GANimorph: Improving Shape Deformation in Unsupervised Image to Image Translation
Aaron Gokaslan, Vivek Ramanujan, Daniel Ritchie, Kwang In Kim, and James Tompkin
ECCV, 2018
code
The eye of the typer: a benchmark and analysis of gaze behavior during typing.
Alexandra Papoutsaki, Aaron Gokaslan, James Tompkin, Yuze He, Jeff Huang
ETRA, 2018

This website is based on Jon Barron's source code,