Aaron Gokaslan

My research focuses on identifying, designing, and building efficient, scalable, sustainable, and affordable abstractions and infrastructure for generative modeling. I also work at the intersection of data, law, and AI policy. I helped create some of the first embodied AI robotics foundation models and frontier-level open-source language models, contributed to datasets including OpenWebText, LAION, BLOOM, OpenThoughts, DCLM, and The Pile, and co-authored the RAIL AI License, the second most widely used AI software license on Hugging Face.

My research focuses identifying, designing, and building efficient, scalable, sustainable, and affordable abstractions and infrastructure for generative modeling research. I also do work at the intersection of data, law, and AI policy. I created some of the first embodied AI robotics foundation models, frontier level open source larguage language models, contributed to datasets including OpenWebText, LAION, BLOOM, OpenThoughs, DCLM,and the Pile, and co-authored the RAIL AI License, the second most popular AI software license on Huggingface

My work has been recognized by orals and invited talks at top conference including NeurIPS, ECCV, ICML, and CVPR. I am a Mozilla RISE25 2024 honoree, and I have received awards for my open source contributions from the Linux Foundation and Mozilla.

I maintain pybind11, PyTorch, and other popular open source libraries. I released one of the first 1 billion parameter+ auto-regression large language models OpenGPT2. I have contributed to the development of popular open source generative AI artifacts including OpenWebText, OpenGPT2, CommonCanvas, CommonCatalog, CommonPile, Habitat-Matterport3D, DataComp-LM, and BLOOM. These works have been collectively downloaded millions of times. I also helped create the Responsible AI License (RAIL) as a co-chair of the BLOOM Workshop. Additionally, I serve on the advisory board of EncodeJustice and Fidutam.

My work has been featured by WIRED, CNN, TechCrunch, and others.

I am currently on the academic and industry job market. I will also be attending NeurIPS 2024 in Vancouver. Please see my Google Scholar for my most up to date publication list.

Email / CV / Google Scholar / Twitter / Github

Mozilla Rise25 Award Video - 3 Min Intro.

Research


	MeMDLM: De Novo Membrane Protein Design with Masked Discrete Diffusion Protein Language Models. Shrey Goel, Vishrut Thoutam, Edgar Mariano Marroquin, Aaron Gokaslan, Arash Firouzbakht, Sophia Vincoff, Volodymyr Kuleshov, Huong T. Kratochvil, Pranam Chatterjee Arxiv, 2024
	Self-Directed Synthetic Dialogues and Revisions Technical Report. Nathan Lambert, Hailey Schoelkopf, Aaron Gokaslan, Luca Soldaini, Valentina Pyatkin, Louis Castricato Arxiv, 2024
	DataComp-LM: In Search of the Next Generation of Training Sets for Language Models. Jeffrey Li, Alex Fang, Georgios Smyrnis, Maor Ivgi, Matt Jordan, Samir Gadre, Hritik Bansal, Etash Guha, Aaron Gokaslan, et al. NeurIPS, 2024
	Vid3D: Synthesis of Dynamic 3D Scenes using 2D Video Diffusion. Rishab Parthasarathy, Zachary Ankner, Aaron Gokaslan Arxiv, 2024
	Simple and Effective Masked Diffusion Language Models. Subham Sekhar Sahoo, Marianne Arriola, Yair Schiff, Aaron Gokaslan, Edgar Marroquin, Justin T Chiu, Alexander Rush, Volodymyr Kuleshov NeurIPS, 2024 Oral at BioML Workshop ICML 2024
	Diffusion Models With Learned Adaptive Noise. Subham Sekhar Sahoo, Aaron Gokaslan, Chris De Sa, Volodymyr Kuleshov NeurIPS, 2024 Spotlight: Top 3% of papers of 15,671 submissions
	Cross-species Modeling of Plant Genomes at Single Nucleotide Resolution Using a Pre-trained DNA Language Model. Jingjing Zhai, Aaron Gokaslan, Yair Schiff, Ana Berthel, Zong-Yan Liu, Zachary R. Miller, Armin Scheben, Michelle C. Stitzer, M. Cinta Romay, Edward S. Buckler, Volodymyr Kuleshov Biorxiv, 2024
	Caduceus: Bi-Directional Equivariant Long-Range DNA Sequence Modeling. Yair Schiff, Chia-Hsiang Kao, Aaron Gokaslan, Tri Dao, Albert Gu, Volodymyr Kuleshov Arxiv, 2024
	On the Standardization of Behavioral Use Clauses and Their Adoption for Responsible Licensing of AI. Daniel McDuff, Tim Korjakow, Scott Cambo, Jesse Josua Benjamin, Jenny Lee, Yacine Jernite, Carlos Muñoz Ferrandis, Aaron Gokaslan, Alek Tarkowski, Joseph Lindley, A Feder Cooper, Danish Contractor Arxiv, 2024
	Advancing DNA Language Models: The Genomics Long-Range Benchmark. Chia Hsiang Kao, Evan Trop, McKinley Polen, Yair Schiff, Bernardo P de Almeida, Aaron Gokaslan, Thomas Pierrot, Volodymyr Kuleshov ICML SPIGM Workshop, 2024 Oral Presentation
	CommonCanvas: An Open Diffusion Model Trained with Creative-Commons Images. Aaron Gokaslan, A. Feder Cooper, Jasmine Collins, Landan Seguin, Austin Jacobson, Mihir Patel, Jonathan Frankle, Cory Stephenson, Volodymyr Kuleshov CVPR, 2024 Accepted to CVPR 2024 Presented at NeurIPS 2023 Diffusion and Content Creativity Workshops
	InfoDiffusion: Representation Learning Using Information Maximizing Diffusion Models. Yingheng Wang, Yair Schiff, Aaron Gokaslan, Weishen Pan, Fei Wang, Christopher De Sa, Volodymyr Kuleshov ICML, 2023
	Galactic: Scaling End-to-End Reinforcement Learning for Rearrangement at 100k Steps-per-Second. Vincent-Pierre Berges, Andrew Szot, Devendra Singh Chaplot, Aaron Gokaslan, Roozbeh Mottaghi, Dhruv Batra, Eric Undersander CVPR, 2023
	The BigScience ROOTS Corpus: A 1.6TB Composite Multilingual Dataset. Hugo Laurençon, Lucile Saulnier, Thomas Wang, Christopher Akiki, Albert Villanova del Moral, Teven Le Scao, Leandro Von Werra, Chenghao Mou, Eduardo González Ponferrada, Huu Nguyen, Jörg Frohberg, Mario Šaško, Quentin Lhoest, Angelina McMillan-Major, Gérard Dupont, Stella Biderman, Anna Rogers, Loubna Ben Allal, Francesco De Toni, Giada Pistilli, Olivier Nguyen, Somaieh Nikpoor, Maraim Masoud, Pierre Colombo, Javier de la Rosa, Paulo Villegas, Tristan Thrush, Shayne Longpre, Sebastian Nagel, Leon Weber, Manuel Romero Muñoz, Jian Zhu, Daniel Van Strien, Zaid Alyafeai, Khalid Almubarak, Vu Minh Chien, Itziar Gonzalez-Dios, Aitor Soroa, Kyle Lo, Manan Dey, Pedro Ortiz Suarez, Aaron Gokaslan, Shamik Bose, David Ifeoluwa Adelani, Long Phan, Hieu Tran, Ian Yu, Suhas Pai, Jenny Chim, Violette Lepercq, Suzana Ilic, Margaret Mitchell, Sasha Luccioni, Yacine Jernite NeurIPS, 2022 Featured Paper: Oral Presentation (≈ 1% acceptance rate)
	Data Governance in the Age of Large-Scale Data-Driven Language Technology. Yacine Jernite, Huu Nguyen, Stella Biderman, Anna Rogers, Maraim Masoud, Valentin Danchev, Samson Tan, Alexandra Sasha Luccioni, Nishant Subramani, Isaac Johnson, Gerard Dupont, Jesse Dodge, Kyle Lo, Zeerak Talat, Dragomir R. Radev, Aaron Gokaslan, Somaieh Nikpoor, Peter Henderson, Rishi Bommasani, Margaret Mitchell FAccT, 2022
	Habitat-Matterport 3D Semantics Dataset. Karmesh Yadav, Ram Ramrakhya, Santhosh Kumar Ramakrishnan, Théophile Gervet, John Turner, Aaron Gokaslan, Noah Maestre, Angel Xuan Chang, Dhruv Batra, Manolis Savva, Alexander William Clegg, Devendra Singh Chaplot arXiv, 2022
	BLOOM: A 176B-Parameter Open-Access Multilingual Language Model. Teven Le Scao, Angela Fan, Christopher Akiki, Ellie Pavlick, Suzana Ilic, Daniel Hesslow, Roman Castagné, Alexandra Sasha Luccioni, François Yvon, Matthias Gallé, Jonathan Tow, Alexander M. Rush, Stella Biderman, Albert Webson, Pawan Sasanka Ammanamanchi, Thomas Wang, Benoît Sagot, Niklas Muennighoff, Albert Villanova del Moral, Olatunji Ruwase, Rachel Bawden, Stas Bekman, Angelina McMillan-Major, Iz Beltagy, Huu Nguyen, Lucile Saulnier, Samson Tan, Pedro Ortiz Suarez, Victor Sanh, Hugo Laurençon, Yacine Jernite, Julien Launay, Margaret Mitchell, Colin Raffel, Aaron Gokaslan, Adi Simhi, Aitor Soroa, Alham Fikri Aji, Amit Alfassy, Anna Rogers, Ariel Kreisberg Nitzav, Canwen Xu, Chenghao Mou, Chris Emezue, Christopher Klamm, Colin Leong, Daniel van Strien, David Ifeoluwa Adelani, et al. arXiv, 2022
	GaussiGAN: Controllable Image Synthesis with 3D Gaussians from Unposed Silhouettes. Youssef Alami Mejjati, Isa Milefchik, Aaron Gokaslan, Oliver Wang, Kwang In Kim, James Tompkin BMVC, 2021
	Waypoint Models for Instruction-guided Navigation in Continuous Environments. Jacob Krantz, Aaron Gokaslan, Dhruv Batra, Stefan Lee, Oleksandr Maksymets ICCV, 2021 Oral Presentation: Top (3%/210) of all (6236) submission
	THDA: Treasure Hunt Data Augmentation for Semantic Navigation. Oleksandr Maksymets, Vincent Cartillier, Aaron Gokaslan, Erik Wijmans, Wojciech Galuba, Stefan Lee, Dhruv Batra ICCV, 2021
	TöRF: Time-of-Flight Radiance Fields for Dynamic Scene View Synthesis. Benjamin Attal, Eliot Laidlaw, Aaron Gokaslan, Changil Kim, Christian Richardt, James Tompkin, Matthew O'Toole NeurIPS, 2021
	Habitat-Matterport 3D Dataset (HM3D): 1000 Large-scale 3D Environments for Embodied AI. Santhosh Kumar Ramakrishnan, Aaron Gokaslan, Erik Wijmans, Oleksandr Maksymets, Alexander Clegg, John Turner, Eric Undersander, Wojciech Galuba, Andrew Westbury, Angel X. Chang, Manolis Savva, Yili Zhao, Dhruv Batra NeurIPS Datasets and Benchmarks, 2021
	Habitat 2.0: Training Home Assistants to Rearrange their Habitat. Andrew Szot, Alexander Clegg, Eric Undersander, Erik Wijmans, Yili Zhao, John Turner, Noah Maestre, Mustafa Mukadam, Devendra Singh Chaplot, Oleksandr Maksymets, Aaron Gokaslan, Vladimir Vondrus, Sameer Dharur, Franziska Meier, Wojciech Galuba, Angel X. Chang, Zsolt Kira, Vladlen Koltun, Jitendra Malik, Manolis Savva, Dhruv Batra NeurIPS, 2021 Spotlight: Top 3% of papers
	OpenGPT-2: open language models and implications of generated text. Vanya Cohen, Aaron Gokaslan XRDS, 2020
	Sim2Real Predictivity: Does Evaluation in Simulation Predict Real-World Performance? Abhishek Kadian, Joanne Truong, Aaron Gokaslan, Alexander Clegg, Erik Wijmans, Stefan Lee, Manolis Savva, Sonia Chernova, Dhruv Batra IEEE Robotics Autom. Lett., 2020 and International Conference on Intelligent Robots and Systems, 2020
	MatryODShka: Real-time 6DoF Video View Synthesis Using Multi-sphere Images. Benjamin Attal, Selena Ling, Aaron Gokaslan, Christian Richardt, James Tompkin ECCV, 2020 Oral Presentation: Top 2% out of 5025 submissions
	Generating Object Stamps. Youssef Alami Mejjati, Zejiang Shen, Michael Snower, Aaron Gokaslan, Oliver Wang, James Tompkin, Kwang In Kim arXiv, 2020
	ObjectNav Revisited: On Evaluation of Embodied Agents Navigating to Objects. Dhruv Batra, Aaron Gokaslan, Aniruddha Kembhavi, Oleksandr Maksymets, Roozbeh Mottaghi, Manolis Savva, Alexander Toshev, Erik Wijmans arXiv, 2020
	Learning Deep Parameterized Skills from Demonstration for Re-targetable Visuomotor Control. Jonathan Chang, Nishanth Kumar, Sean Hastings, Aaron Gokaslan, Diego Romeres, Devesh K. Jha, Daniel Nikovski, George Dimitri Konidaris, Stefanie Tellex arXiv, 2019
	GANimorph: Improving Shape Deformation in Unsupervised Image to Image Translation Aaron Gokaslan, Vivek Ramanujan, Daniel Ritchie, Kwang In Kim, and James Tompkin ECCV, 2018 code
	The eye of the typer: a benchmark and analysis of gaze behavior during typing. Alexandra Papoutsaki, Aaron Gokaslan, James Tompkin, Yuze He, Jeff Huang ETRA, 2018

This website is based on Jon Barron's source code,