Nur Muhammad "Mahi" Shafiullah

Building generally intelligent robots that just work everywhere, out of the box.

mahi at cs.nyu.edu

Curriculam Vitae

Google Scholar

Writings

New York, NY

Hello, I am Mahi!

I am currently a 🎉final-year🎉 Computer Science Ph.D. student at NYU Courant working with Lerrel Pinto.

I am focused on building generally intelligent robots that just work everywhere, out of the box. I research questions on the intersection of machine learning and robotics, primarily to understand the representation, data, and memory structures that will lead to, among others, home robots that can perform everyday tasks by learning both from humans and on their own.

I am grateful for the Apple Scholars in AI/ML PhD fellowship for supporting my research. Previously, I was a visiting scientist at Fundamental AI Research (FAIR) at Meta with Ishan Misra.

If you're interested, you can find my academic CV here.

Updates

> Recent talks: My talk on Robot Utility Models at the MIT Embodied AI seminar is now on YouTube, and all the materials from our RSS '24 tutorial on Supervised Policy Learning for Real Robot is also available (my sections: 1, 2).
> Nov. 24: Our DynaMem won best paper award at the CoRL Lifelong Learning for Home Robots workshop 🎉 Congrats, Peiqi!
> Sep. 24: I will be visiting CMU, MIT, and Harvard to talk about our recent work, Robot Utility Models.
> Aug. 24: We will be demonstrating Robot Utility Models and DynaMem live at CoRL 2024. Stop by if you're around!
> Jul. 24: VQ-BeT won an Outstanding paper award at the ICML 2024 Multimodal Foundation Models for Embodied AI workshop!
> Jul. 24: I presented OK-Robot at RSS '24 (presentation here.) Sadly, our robot got stuck at the customs so we had to demo virtually.

Selected Research

My research focuses on understanding the interplay of representation, data, and memory in robotics. Here, you can see a few of my recent research projects. You can find a full list of my works on Google Scholar.
Filter by topic: Policy learning representations Scaling data for generalization Semantic robot memory Show all

DynaMem: Online Dynamic Spatio-Semantic Memory for Open World Mobile Manipulation

long-term-memory

Best Paper @ CoRL 2024 Lifelong Learning for Home Robots Workshop

Peiqi Liu, Zhanqiu Guo, Mohit Warke, Soumith Chintala, Chris Paxton, Nur Muhammad Mahi Shafiullah*, Lerrel Pinto*

Paper Code Project Site

Robot Utility Models: General Policies for Zero-Shot Deployment in New Environments

scaling-systems-and-data

Oral @ Workshop on Language and Robot Learning, CoRL 2024

Haritheja Etukuru*, Norihito Naka, Zijin Hu, Seungjae Lee, Julian Mehu, Aaron Edsinger, Chris Paxton, Soumith Chintala, Lerrel Pinto, Nur Muhammad Mahi Shafiullah*

Paper Code Project Site

Behavior Generation with Latent Actions

policy-representations

Top 3.5% (Spotlight) @ International Conference of Machine Learning (ICML) 2024

Seungjae Lee, Yibin Wang, Haritheja Etukuru, H. Jin Kim, Nur Muhammad Mahi Shafiullah*, Lerrel Pinto*

Paper Code Project Site

OK-Robot: What Really Matters in Integrating Open-Knowledge Models for Robotics

long-term-memory

Robotics: Science and Systems (RSS) 2024

Peiqi Liu, Yaswanth Orru, Chris Paxton, Nur Muhammad Mahi Shafiullah*, Lerrel Pinto*

Paper Code Project Site

On Bringing Robots Home

scaling-systems-and-data

Best Demo Finalist @ International Conference in Robotics and Automation (ICRA) EXPO 2024

Nur Muhammad Mahi Shafiullah, Anant Rai, Haritheja Etukuru, Yiqian Liu, Ishan Misra, Soumith Chintala, Lerrel Pinto

Paper Code Project Site

From Play to Policy: Conditional Behavior Generation from Uncurated Robot Data

policy-representations

Top 5% (Oral) @ International Conference of Learning Representations (ICLR) 2023

Zichen Jeff Cui, Yibin Wang, Nur Muhammad Mahi Shafiullah, Lerrel Pinto

Paper Code Project Site

CLIP-Fields: Weakly Supervised Semantic Fields for Robotic Memory

long-term-memory

Outstanding Paper Award @ Workshop on Language and Robot Learning, CoRL 2022; Robotics: Science and Systems (RSS) 2023

Nur Muhammad Mahi Shafiullah, Chris Paxton, Lerrel Pinto, Soumith Chintala, Arthur Szlam

Paper Code Project Site

Behavior Transformers: Cloning k modes with one stone

policy-representations

Neural Information Processing Systems (NeurIPS) 2022

Nur Muhammad Mahi Shafiullah, Zichen Jeff Cui, Ariuntuya Altanzaya, Lerrel Pinto

Paper Code Project Site

The Surprising Effectiveness of Representation Learning for Visual Imitation

policy-representations

Robotics: Science, and Systems (RSS) 2022

Jyothish Pari*, Nur Muhammad (Mahi) Shafiullah*, Sridhar Pandian Arunachalam, Lerrel Pinto

Paper Code Project Site

Projects

Code releases from my research projects and open source projects that I've contributed to.

lerobot

🤗 LeRobot: Making AI for Robotics more accessible with end-to-end learning

Python 15746 2140

dobb-e

Dobb·E: An open-source, general framework for learning household robotic manipulation

G-code 600 54

ok-robot

An open, modular framework for zero-shot, language conditioned pick-and-drop tasks in arbitrary homes.

Python 533 39

robot-utility-models

Robot Utility Models are trained on a diverse set of environments and objects, and then can be deployed in novel environments with novel objects without any further data or training.

Python 217 10

clip-fields

Teaching robots to respond to open-vocab queries with CLIP and NeRF-like neural fields

Python 174 22

bet

Code and website for Behavior Transformers: Cloning k modes with one stone.

Python 126 20

miniBET

Clean implementation of conditional and unconditional behavior transformer.

Python 31 5

disk

PyTorch implementation for "Discovery of Incremental Skills" (DISk) algorithm from ICLR 2022 paper "One After Another: Learning Incremental Skills for a Changing World"

Python 19 1