See full post

Lerrel Pinto

lerrelpinto.com

Followers · Following

Assistant Professor of CS @nyuniversity. I like robots!

Joined February 2024

Posts Replies Media Original posts Likes

Reposted by Lerrel Pinto
Robot Talk robottalkpod.bsky.social · May 21, 2025
How is AI helping robots to generalise their skills to unfamiliar environments? 🤖 🏠 In the latest episode, I chatted to Prof. Lerrel Pinto (@lerrelpinto.com) from New York University about #robot learning and decision making. Available wherever you get your podcasts: linktr.ee/robottalkpod

View on Bluesky Download video Show all post labels

Lerrel Pinto lerrelpinto.com · Apr 18, 2025
We just released RUKA, a $1300 humanoid hand that is 3D-printable, strong, precise, and fully open sourced! The key technical breakthrough here is that we can control joints and fingertips of the robot **without joint encoders**. All we need here is self-supervised data collection and learning.

View on Bluesky Download video Show all post labels
Lerrel Pinto lerrelpinto.com · Apr 18, 2025
This project, which combines hardware design with learning-based controllers was a monumental effort led by @anyazorin.bsky.social and Irmak Guzey. More links and information about RUKA are below: Website: ruka-hand.github.io Assembly Instructions: ruka.gitbook.io/instructions

View on Bluesky Show all post labels

Lerrel Pinto lerrelpinto.com · Mar 28, 2025
When life gives you lemons, you pick them up. (trained with robotutilitymodels.com)

View on Bluesky Download video Show all post labels

Reposted by Lerrel Pinto
Robot Talk robottalkpod.bsky.social · Mar 18, 2025
[Not loaded yet]

View on Bluesky Show all post labels

Lerrel Pinto lerrelpinto.com · Mar 2, 2025
Is there a word for the feeling when you want to cheer for the other team?

View on Bluesky Show all post labels

Lerrel Pinto lerrelpinto.com · Feb 28, 2025
The robot behaviors shown below are trained without any teleop, sim2real, genai, or motion planning. Simply show the robot a few examples of doing the task yourself, and our new method, called Point Policy, spits out a robot-compatible policy!

View on Bluesky Download video Show all post labels
Lerrel Pinto lerrelpinto.com · Feb 28, 2025
Point Policy uses sparse key points to represent both human demonstrators and robots, bridging the morphology gap. The scene is hence encoded through semantically meaningful key points from minimal human annotations.

View on Bluesky Download video Show all post labels
Lerrel Pinto lerrelpinto.com · Feb 28, 2025
The overall algorithm is simple: 1. Extract key points from human videos. 2. Train a transformer policy to predict future robot key points. 3. Convert predicted key points to robot actions.

View on Bluesky Show all post labels
Lerrel Pinto lerrelpinto.com · Feb 28, 2025
This project was an almost solo effort from @haldarsiddhant.bsky.social. And as always, this project is fully opensourced. Project page: point-policy.github.io Paper: arxiv.org/abs/2502.20391
Point Policy: Unifying Observations and Actions with Key Points for Robot Manipulation

Point Policy: Unifying Observations and Actions with Key Points for Robot Manipulation

point-policy.github.io

View on Bluesky Show all post labels

Reposted by Lerrel Pinto
Chris Paxton handle.invalid · Feb 26, 2025
This is important because the humble iPhone is one of the best accessories for embodied AI out there, if not actually the best. It's got a depth sensor, good camera, built-in internet, decent compute, and -- uniquely -- it has really good slam already built in.

View on Bluesky Download image Show all post labels

Lerrel Pinto lerrelpinto.com · Feb 26, 2025
We just released AnySense, an iPhone app for effortless data acquisition and streaming for robotics. We leverage Apple’s development frameworks to record and stream: 1. RGBD + Pose data 2. Audio from the mic or custom contact microphones 3. Seamless Bluetooth integration for external sensors

View on Bluesky Download video Show all post labels
Lerrel Pinto lerrelpinto.com · Feb 26, 2025
With this 'wild' robot data, data collected by AnySense can then be used to train multimodal policies! In the video above, we use the Robot Utility Models framework to train Visuo-Tactile policies for a whiteboard erasing task. You can use it for so much more though!

View on Bluesky Show all post labels
Lerrel Pinto lerrelpinto.com · Feb 26, 2025
AnySense is built to empower researchers with better tools for robotics. Try it out below. Download on App store: apps.apple.com/us/app/anyse... Open-source code on GitHub: github.com/NYU-robot-le... Website: anysense.app AnySense is led by @raunaqb.bsky.social with several from NYU.
‎AnySense

‎AnySense is an open-source iPhone app that enables multi-sensory data collection by integrating the iPhone’s sensory suite with external sensors via Bluetooth and wired interfaces, enabling both offl...

apps.apple.com

View on Bluesky Show all post labels

Reposted by Lerrel Pinto
Eugene Vinitsky 🍒 eugenevinitsky.bsky.social · Feb 23, 2025
[Not loaded yet]

View on Bluesky Show all post labels

Lerrel Pinto lerrelpinto.com · Feb 20, 2025
Just found a new winner for the most hype-baiting, unscientific plot I have seen. (From the recent Figure AI release)

View on Bluesky Download image Show all post labels

Reposted by Lerrel Pinto
Eugene Vinitsky 🍒 eugenevinitsky.bsky.social · Feb 20, 2025
One reason to be intolerant of misleading hype in tech and science is that tolerating the small lies and deception is how you get tolerance of big lies

View on Bluesky Show all post labels

Lerrel Pinto lerrelpinto.com · Feb 18, 2025
Thank you to @sloanfoundation.bsky.social for this generous award to our lab. Hopefully this will bring us closer to building truly general-purpose robots!
- Alfred P. Sloan Foundation sloanfoundation.bsky.social · Feb 18, 2025
  🎉Congrats to the 126 early-career scientists who have been awarded a Sloan Research Fellowship this year! These exceptional scholars are drawn from 51 institutions across the US and Canada, and represent the next generation of groundbreaking researchers. sloan.org/fellowships/...
View on Bluesky Show all post labels

Lerrel Pinto lerrelpinto.com · Feb 13, 2025
A fun, clever idea from @upiter.bsky.social : treat code generation as a sequential editing problem -- this gives you loads of training data from synthetically editing existing code And it works! Higher performance on HumanEval, MBPP, and CodeContests across small LMs like Gemma-2, Phi-3, Llama 3.1
- Ulyana Piterbarg upiter.bsky.social · Feb 12, 2025
  [Not loaded yet]
View on Bluesky Show all post labels

Lerrel Pinto lerrelpinto.com · Jan 31, 2025
We have been working a bunch on offline world models. Pre-trained features from DINOv2 seem really powerful for modeling. I hope this opens up a whole set of applications for decision making and robotics! Check out the thread from @gaoyuezhou.bsky.social for more details.
- gaoyuezhou gaoyuezhou.bsky.social · Jan 31, 2025
  Can we extend the power of world models beyond just online model-based learning? Absolutely! We believe the true potential of world models lies in enabling agents to reason at test time. Introducing DINO-WM: World Models on Pre-trained Visual Features for Zero-shot Planning.
View on Bluesky Show all post labels

Reposted by Lerrel Pinto
Eugene Vinitsky 🍒 eugenevinitsky.bsky.social · Jan 9, 2025
[Not loaded yet]

View on Bluesky Show all post labels

Reposted by Lerrel Pinto
Laine Nooney @ 🔥🗑️🔥 lainenooney.bsky.social · Dec 30, 2024
omg a student somehow accidentally wrote an email addressed to a faculty-wide NYU listserv and my inbox is now a master class on who understands the difference between a listserv and an email chain

View on Bluesky Show all post labels

Reposted by Lerrel Pinto
Florian Ederer florianederer.bsky.social · Dec 25, 2024
Humans vs Ants: Problem-solving Skills

View on Bluesky Download video Show all post labels

Lerrel Pinto lerrelpinto.com · Dec 18, 2024
At NYU Abu Dhabi today and in love how cat friendly the campus is!

View on Bluesky Download image Show all post labels

Reposted by Lerrel Pinto
Daniel Rolph Author danielrolphauthor.bsky.social · Dec 15, 2024
This holiday season, take a moment to visit your local bookstore. It’s about more than finding a great book—it’s about supporting the small businesses that keep our communities thriving.

View on Bluesky Show all post labels

Reposted by Lerrel Pinto
Remi Cadene remicadene.bsky.social · Dec 15, 2024
HOT 🔥 fastest, most precise, and most capable hand control setup ever... Less than $450 and fully open-source 🤯 by @huggingface, @therobotstudio, @NepYope This tendon-driven technology will disrupt robotics! Retweet to accelerate its democratization 🚀 A thread 🧵

View on Bluesky Download video Show all post labels

Reposted by Lerrel Pinto
Alfredo Canziani alfcnz.bsky.social · Dec 13, 2024
Outstanding presentation, finally! DynaMo: In-Domain Dynamics Pretraining for Visuo-Motor Control @jeffacce.bsky.social @lerrelpinto.com

View on Bluesky Download image (1)Download image (2)Show all post labels

Reposted by Lerrel Pinto
braintelligence braintelligence.bsky.social · Dec 11, 2024
Replying to Lerrel Pinto
Love this approach. Reminds me of a more detailed version of an idea I had. Will definitely look deeper into this ironj.github.io/eleuther/
EleutherAI: Tool Use Idea

I created a working diagram of how a set of agent AI models could be used to answer math questions using tools. At the time I wrote this post, tool use wasn’t commonly understood, nor was agent AI bas...

ironj.github.io

View on Bluesky Show all post labels

Lerrel Pinto lerrelpinto.com · Dec 10, 2024
New paper! We show that by using keypoint-based image representation, robot policies become robust to different object types and background changes. We call this method Prescriptive Point Priors for robot Policies or P3-PO in short. Full project is here: point-priors.github.io

View on Bluesky Download video Show all post labels
Lerrel Pinto lerrelpinto.com · Dec 10, 2024
P3-PO uses a one time “point prescription” by a human to identify key points. After this it uses semantic correspondence to find the same points on different instances of the same object.

View on Bluesky Download video Show all post labels
Lerrel Pinto lerrelpinto.com · Dec 10, 2024
This work was led by @maralevy.bsky.social and a wonderful collaboration with @haldarsiddhant.bsky.social and @abhinav-sh.bsky.social !

View on Bluesky Show all post labels

Lerrel Pinto lerrelpinto.com · Dec 9, 2024
Modern policy architectures are unnecessarily complex. In our #NeurIPS2024 project called BAKU, we focus on what really matters for good policy learning. BAKU is modular, language-conditioned, compatible with multiple sensor streams & action multi-modality, and importantly fully open-source!

View on Bluesky Download video Show all post labels
Lerrel Pinto lerrelpinto.com · Dec 9, 2024
BAKU consists of three modules: 1. Sensor encoders for vision, language, and state 2. Observation trunk to fuse multimodal inputs 3. Action head for predicting actions. This allows BAKU to combine different action models like VQ-BeT and Diffusion Policy under one framework.

View on Bluesky Download image Show all post labels
Lerrel Pinto lerrelpinto.com · Dec 9, 2024
More details are here: baku-robot.github.io BAKU was led by @haldarsiddhant.bsky.social who is will be presenting at NeurIPS this Thursday from 11 a.m. PST — 2 p.m. PST. So catch him if you around!

View on Bluesky Show all post labels

Reposted by Lerrel Pinto
Raunaq Bhirangi raunaqb.bsky.social · Dec 9, 2024
Robot utility models are not just among the first learned models that work zero-shot on a mobile manipulator, but also provide a nuanced discussion on what works and what doesn't in data-driven robot learning.
- Lerrel Pinto lerrelpinto.com · Dec 8, 2024
  Since we are nearing the end of the year, I'll revisit some of our work I'm most excited about from the last year and maybe a sneak peek of what we are up to next. To start of, Robot Utility Models, which enables zero-shot deployment. In the video below, the robot hasnt seen these doors before.
View on Bluesky Show all post labels

Lerrel Pinto lerrelpinto.com · Dec 8, 2024
Since we are nearing the end of the year, I'll revisit some of our work I'm most excited about from the last year and maybe a sneak peek of what we are up to next. To start of, Robot Utility Models, which enables zero-shot deployment. In the video below, the robot hasnt seen these doors before.

View on Bluesky Download video Show all post labels
Lerrel Pinto lerrelpinto.com · Dec 8, 2024
There are three main components to build RUMs: diverse expert data + multi-modal behavior cloning + mLLM feedback. hardware, code & pretrained policies are fully opensourced: robotutilitymodels.com

View on Bluesky Download image Show all post labels
Lerrel Pinto lerrelpinto.com · Dec 8, 2024
Our awesome undergrad lead on this project @haritheja.bsky.social took RUMs to Munich for CoRL 2024 and showed it work zero-shot in opening doors and drawers bought from German IKEA.

View on Bluesky Download video Show all post labels
Lerrel Pinto lerrelpinto.com · Dec 8, 2024
RUMs is the brainchild of @notmahi.bsky.social with several insightful experiments. The most important one being that data diversity >> data quantity. Another insight is that regardless of the algorithm there is a similar-ish scaling law across tasks. Check out the paper: arxiv.org/abs/2409.05865

View on Bluesky Download image Show all post labels

Reposted by Lerrel Pinto
Chris Paxton handle.invalid · Dec 3, 2024
I'd like to introduce what I've been working at @hellorobot.bsky.social: Stretch AI, a set of open-source tools for language-guided autonomy, exploration, navigation, and learning from demonstration. Check it out: github.com/hello-robot/... Thread ->

View on Bluesky Download video Show all post labels

Reposted by Lerrel Pinto
Jia-Bin Huang jbhuang0604.bsky.social · Dec 1, 2024
How to drive your research forward? “I tested the idea we discussed last time. Here are some results. It does not work. (… awkward silence)” Such conversations happen so many times when meetings with students. How do we move forward? You need …

View on Bluesky Show all post labels

Reposted by Lerrel Pinto
Grace Lindsay handle.invalid · Nov 27, 2024
[Not loaded yet]

View on Bluesky Show all post labels

Lerrel Pinto lerrelpinto.com · Nov 27, 2024
I think we need an AMA series for Robotics / Embodied AI with an optional anonymous setting. Will be both fun and informative to new community members to absorb folk knowledge.
- Eugene Vinitsky 🍒 eugenevinitsky.bsky.social · Nov 26, 2024
  [Not loaded yet]
View on Bluesky Show all post labels

Reposted by Lerrel Pinto
Cathy Wu cathywu.bsky.social · Nov 27, 2024
Replying to Eugene Vinitsky 🍒
I collected some folk knowledge for RL and stuck them in my lecture slides a couple weeks back: web.mit.edu/6.7920/www/l... See Appendix B... sorry, I know, appendix of a lecture slide deck is not the best for discovery. Suggestions very welcome.
https://web.mit.edu/6.7920/www/lectures/L18-2024fa-Evaluation.pdf

web.mit.edu

View on Bluesky Show all post labels

Reposted by Lerrel Pinto
Ken Goldberg ken-goldberg.bsky.social · Nov 26, 2024
Interesting article but the author drank the Kool-Aid and never sought out other viewpoints: “Foundation models like GPT-4 have largely subsumed [previous] models that help robots with planning and vision, and locomotion and dexterity will probably soon be subsumed, too.”
- Chris Paxton handle.invalid · Nov 26, 2024
  [Not loaded yet]
View on Bluesky Show all post labels

Lerrel Pinto lerrelpinto.com · Nov 26, 2024
Nice work Remi and team. We need more of this!
- Remi Cadene remicadene.bsky.social · Nov 26, 2024
  We're building a tendon-driven dexterous hand. It can grasp many more objects and even use tools. With Rob Knight, we focus on open-source to help democratize robotics, accelerate research and benefit everyone 🤗
View on Bluesky Show all post labels

An unhandled error has occurred. Reload 🗙