Chronological feed of everything captured from Pieter Abbeel.
paper / pabbeel / 1d ago
D-REX introduces a differentiable real-to-sim-to-real engine leveraging Gaussian Splat representations for robotic systems. This engine aims to bridge the simulation-to-real-world gap by enabling object mass identification from visual observations and control signals, while simultaneously facilitating grasping policy learning. It constructs high-fidelity digital twins by optimizing object mass and incorporates a novel method for training force-aware grasping policies using transferred human demonstrations.
differentiable-simulationroboticsdexterous-graspingreal-to-sim-to-realphysical-parameter-identificationgrasping-policy-learninggaussian-splat-representations
“D-REX utilizes a differentiable real-to-sim-to-real engine incorporating Gaussian Splat representations.”
paper / pabbeel / 1d ago
This paper presents a two-stage learning framework for fine-grained robotic manipulation tasks with subjective success criteria, using knife-peeling as a case study. The approach combines force-aware imitation learning for robust initial policy generation with preference-based finetuning using a learned reward model that incorporates human feedback, resulting in high success rates and strong generalization across various produce. This method demonstrates a viable pathway for robots to master complex, dexterous tasks requiring qualitative assessment.
roboticsimitation-learningpreference-learninghuman-robot-interactionfine-grained-manipulationrobot-learning
“Robotic manipulation tasks with implicit, continuous, and subjective success criteria are challenging for autonomous robots.”
paper / pabbeel / 1d ago
Traditional reinforcement learning agents struggle with reward misspecification and adapting to changing preferences because they are trained on a single, fixed reward function. Reward-Conditioned Reinforcement Learning (RCRL) addresses this limitation by training a single agent to optimize a family of reward specifications. RCRL leverages off-policy learning from shared replay data to enable a single policy to represent reward-specific behaviors, improving performance and facilitating efficient adaptation across diverse tasks.
reinforcement-learningreward-conditioned-rloff-policy-learningmulti-task-learningrobust-policiesmachine-learning-research
“Traditional RL agents are brittle to reward misspecification and have limited adaptability due to fixed reward functions.”
paper / pabbeel / 1d ago
CliqueFlowmer addresses the limitations of maximum likelihood-based generative models in computational materials discovery (CMD) by employing offline model-based optimization (MBO). By integrating clique-based MBO into a transformer and flow-based generation architecture, the model enables the direct optimization of target material properties. Empirical validation indicates that this approach significantly outperforms traditional generative baselines in discovering high-performance materials.
materials-discoverydeep-learningmodel-based-optimizationgenerative-modelstransformer-networkscomputational-materials-design
“Standard generative modeling methods are ineffective at exploring optimal regions of materials space due to maximum likelihood training.”
paper / pabbeel / 1d ago
XL-VLA introduces a novel vision-language-action framework that utilizes a unified, embodiment-invariant latent action space. This approach enables scalable cross-embodiment training and efficient data reuse for dexterous manipulation tasks, addressing the challenge of costly data collection for diverse robotic hands. The model consistently outperforms baseline VLA models operating in raw joint spaces.
roboticsdexterous-manipulationvision-language-action-modelscross-embodiment-learninglatent-representation
“Traditional VLA models for dexterous manipulation are hampered by the need for extensive data collection for each new robotic hand embodiment.”
tweet / @pabbeel / Feb 25
Pieter Abbeel, a prominent AI researcher, has indicated a past collaboration with an individual named David and expressed anticipation for future endeavors. The nature of the collaboration and future plans are not disclosed in this brief message.
social-mediafarewellprofessional-networking
“Pieter Abbeel has previously collaborated with an individual named David.”
tweet / @pabbeel / Jan 16
The content is a very brief social media post expressing congratulations. It lacks substantive information, making it impossible to extract any meaningful insights or technical details. The post is purely celebratory and devoid of any falsifiable claims or data.
congratulationscommunitysocial-media
“Pieter Abbeel congratulated Deepak.”
tweet / @pabbeel / Dec 19 / failed
tweet / @pabbeel / Dec 18
This content is an empty congratulatory message about a body of research without any specific details to extract. It contains no actionable insights or identifiable claims about the research itself.
congratulationsresearchx-feedpieter-abbeel
tweet / @pabbeel / Dec 1
Amazon FAR has open-sourced Holosoma, a comprehensive robotics platform designed to address the full-stack challenges of sim-to-real learning for humanoid robots. Holosoma provides a unified framework supporting multiple simulation backends (IsaacGym, IsaacSim, MJWarp), various robots (humanoid and quadruped), and efficient reinforcement learning algorithms. Its modular architecture and open inference pipeline aim to lower the barrier to entry for robotics research by enabling rapid iteration and seamless transfer from simulation to real-world deployment.
roboticsopen-sourcehumanoidssimulationreinforcement-learningsim-to-realrobot-locomotion
“Holosoma is a full-stack open-source solution for sim-to-real learning in humanoid robotics.”
tweet / @pabbeel / Dec 1
Pieter Abbeel, a prominent figure in AI, expressed a positive sentiment with the single word 'beautiful!' in response to unspecified content on his X (formerly Twitter) feed. This reaction, while brief, indicates a favorable impression without providing specific details or technical insights into the subject matter.
user-notesocial-mediapieter-abbeelsentiment-analysispositive-sentiment
“Pieter Abbeel posted the word 'beautiful!' on his X feed.”
tweet / @pabbeel / Nov 27
Meta, in collaboration with Microsoft, released Llama 2 on July 18, 2023. This release includes models ranging from 7B to 70B parameters, making advanced large language models more accessible.
llama-2llm-releasemeta-aimicrosoft-ai70b-model
“Llama 2 was released on July 18, 2023.”
tweet / @pabbeel / Nov 25
The provided content is extremely brief and lacks substantive information. It consists only of an interjection and a note about automatic ingestion. Therefore, it is impossible to extract meaningful insights, key claims, or a detailed synthesis.
pieter-abbeelsocial-mediaimpressions-reactions
tweet / @pabbeel / Oct 21
The provided content consists of a single-word reaction ('impressive') to an external piece of media. It contains no technical data, assertions, or substantive information suitable for knowledge extraction.
x-feedpieter-abbeelsocial-mediaai
tweet / @pabbeel / Oct 21
GaussGym offers significant improvements to environments used for training AI in locomotion. This advancement is expected to facilitate more effective and efficient development of robotic and simulated agents capable of complex movement.
locomotion-trainingai-environmentsrobotics
“GaussGym provides an upgrade for environments used in training locomotion capabilities.”
youtube / pabbeel / Oct 7
Deploying machine learning models in real-world robotics presents unique challenges beyond typical software applications, primarily due to the stringent reliability requirements. Unlike spam filters where partial success is valuable, robot failures often incur significant costs or necessitate extensive human intervention. Achieving commercially viable performance (99.5-99.9% reliability) demands a blend of scientific breakthroughs in AI and meticulous engineering, focusing on robust data collection, model architectures, and loss functions.
roboticsreinforcement-learningimitation-learningmachine-learning-in-productiondata-collectionrobot-manipulationai-startups
“Robotics demands exceptionally high reliability (99.5-99.9%) for commercial viability, unlike many software-only AI applications.”
youtube / pabbeel / Aug 8
Pieter Abbeel, a leading AI and robotics researcher, discusses the current state and future trajectory of artificial intelligence, emphasizing the transition from purely academic research to real-world applications. He highlights the role of AI in transforming industries like logistics and manufacturing and addresses the challenges and opportunities in democratizing AI capabilities beyond large tech companies.
ai-roboticsmachine-learningstartupsai-policydata-sciencefuture-of-aicareer-development
“Current robotics applications are primarily in structured manufacturing environments performing repetitive tasks.”
youtube / pabbeel / Apr 19
Pieter Abbeel discusses the transition of AI robotics from research labs and simulations to practical real-world applications, focusing on the need for increased intelligence and adaptability in robots. He highlights the distinction between core academic research, which prioritizes pure learning approaches, and real-world deployment, where incorporating prior knowledge and robustness is crucial for reliability and commercial value. Abbeel emphasizes the importance of generalized learning systems that can handle diverse and dynamic environments, moving beyond pre-programmed motions to cognitive, reactive robotic intelligence.
ai-roboticsreinforcement-learningunsupervised-learningindustrial-automationtransformer-modelsmultimodal-ai
“AI robotics is transitioning from simulation-based research to real-world applications, especially in logistics and warehousing.”
youtube / pabbeel / Dec 16 / failed
youtube / pabbeel / Jun 22 / failed