absorb.md — A knowledge graph of what AI thinkers are actually saying

paper / pabbeel / Apr 10

D-REX: Differentiable Real-to-Sim-to-Real Engine for Dexterous Grasping

D-REX introduces a differentiable real-to-sim-to-real engine leveraging Gaussian Splat representations for robotic systems. This engine aims to bridge the simulation-to-real-world gap by enabling object mass identification from visual observations and control signals, while simultaneously facilitating grasping policy learning. It constructs high-fidelity digital twins by optimizing object mass and incorporates a novel method for training force-aware grasping policies using transferred human demonstrations.

differentiable-simulationroboticsdexterous-graspingreal-to-sim-to-realphysical-parameter-identificationgrasping-policy-learninggaussian-splat-representations

“D-REX utilizes a differentiable real-to-sim-to-real engine incorporating Gaussian Splat representations.”

paper / pabbeel / Apr 10

Robotic Peeling with Human Preference Alignment

This paper presents a two-stage learning framework for fine-grained robotic manipulation tasks with subjective success criteria, using knife-peeling as a case study. The approach combines force-aware imitation learning for robust initial policy generation with preference-based finetuning using a learned reward model that incorporates human feedback, resulting in high success rates and strong generalization across various produce. This method demonstrates a viable pathway for robots to master complex, dexterous tasks requiring qualitative assessment.

roboticsimitation-learningpreference-learninghuman-robot-interactionfine-grained-manipulationrobot-learning

“Robotic manipulation tasks with implicit, continuous, and subjective success criteria are challenging for autonomous robots.”

paper / pabbeel / Apr 10

Reward-Conditioned Reinforcement Learning for Adaptive Policies

Traditional reinforcement learning agents struggle with reward misspecification and adapting to changing preferences because they are trained on a single, fixed reward function. Reward-Conditioned Reinforcement Learning (RCRL) addresses this limitation by training a single agent to optimize a family of reward specifications. RCRL leverages off-policy learning from shared replay data to enable a single policy to represent reward-specific behaviors, improving performance and facilitating efficient adaptation across diverse tasks.

reinforcement-learningreward-conditioned-rloff-policy-learningmulti-task-learningrobust-policiesmachine-learning-research

“Traditional RL agents are brittle to reward misspecification and have limited adaptability due to fixed reward functions.”

paper / pabbeel / Apr 10

Accelerating Materials Discovery via Clique-Based Offline Model-Based Optimization

CliqueFlowmer addresses the limitations of maximum likelihood-based generative models in computational materials discovery (CMD) by employing offline model-based optimization (MBO). By integrating clique-based MBO into a transformer and flow-based generation architecture, the model enables the direct optimization of target material properties. Empirical validation indicates that this approach significantly outperforms traditional generative baselines in discovering high-performance materials.

materials-discoverydeep-learningmodel-based-optimizationgenerative-modelstransformer-networkscomputational-materials-design

“Standard generative modeling methods are ineffective at exploring optimal regions of materials space due to maximum likelihood training.”

paper / pabbeel / Apr 10

XL-VLA: A Cross-Embodiment Latent Space for Dexterous Robot Manipulation

XL-VLA introduces a novel vision-language-action framework that utilizes a unified, embodiment-invariant latent action space. This approach enables scalable cross-embodiment training and efficient data reuse for dexterous manipulation tasks, addressing the challenge of costly data collection for diverse robotic hands. The model consistently outperforms baseline VLA models operating in raw joint spaces.

roboticsdexterous-manipulationvision-language-action-modelscross-embodiment-learninglatent-representation

“Traditional VLA models for dexterous manipulation are hampered by the need for extensive data collection for each new robotic hand embodiment.”

tweet / @pabbeel / Feb 25

Pieter Abbeel Announces Collaboration with David

Pieter Abbeel, a prominent AI researcher, has indicated a past collaboration with an individual named David and expressed anticipation for future endeavors. The nature of the collaboration and future plans are not disclosed in this brief message.

social-mediafarewellprofessional-networking

“Pieter Abbeel has previously collaborated with an individual named David.”

tweet / @pabbeel / Jan 16

Trivial Social Media Engagement

The content is a very brief social media post expressing congratulations. It lacks substantive information, making it impossible to extract any meaningful insights or technical details. The post is purely celebratory and devoid of any falsifiable claims or data.

congratulationscommunitysocial-media

“Pieter Abbeel congratulated Deepak.”

tweet / @pabbeel / Dec 19 / failed

@hallmarkchannel

tweet / @pabbeel / Dec 18

Empty Research Congratulations

This content is an empty congratulatory message about a body of research without any specific details to extract. It contains no actionable insights or identifiable claims about the research itself.

congratulationsresearchx-feedpieter-abbeel

tweet / @pabbeel / Dec 1

Amazon FAR Open-Sources Holosoma: A Full-Stack Robotics Platform for Sim-to-Real Transfer

Amazon FAR has open-sourced Holosoma, a comprehensive robotics platform designed to address the full-stack challenges of sim-to-real learning for humanoid robots. Holosoma provides a unified framework supporting multiple simulation backends (IsaacGym, IsaacSim, MJWarp), various robots (humanoid and quadruped), and efficient reinforcement learning algorithms. Its modular architecture and open inference pipeline aim to lower the barrier to entry for robotics research by enabling rapid iteration and seamless transfer from simulation to real-world deployment.

roboticsopen-sourcehumanoidssimulationreinforcement-learningsim-to-realrobot-locomotion

“Holosoma is a full-stack open-source solution for sim-to-real learning in humanoid robotics.”

tweet / @pabbeel / Dec 1

Pieter Abbeel's Positive Reaction to Unspecified Content

Pieter Abbeel, a prominent figure in AI, expressed a positive sentiment with the single word 'beautiful!' in response to unspecified content on his X (formerly Twitter) feed. This reaction, while brief, indicates a favorable impression without providing specific details or technical insights into the subject matter.

user-notesocial-mediapieter-abbeelsentiment-analysispositive-sentiment

“Pieter Abbeel posted the word 'beautiful!' on his X feed.”

tweet / @pabbeel / Nov 27

Meta Releases Llama 2 with Microsoft Collaboration

Meta, in collaboration with Microsoft, released Llama 2 on July 18, 2023. This release includes models ranging from 7B to 70B parameters, making advanced large language models more accessible.

llama-2llm-releasemeta-aimicrosoft-ai70b-model

“Llama 2 was released on July 18, 2023.”

tweet / @pabbeel / Nov 25

Empty Content Analysis

The provided content is extremely brief and lacks substantive information. It consists only of an interjection and a note about automatic ingestion. Therefore, it is impossible to extract meaningful insights, key claims, or a detailed synthesis.

pieter-abbeelsocial-mediaimpressions-reactions

tweet / @pabbeel / Oct 21

Insufficient Content for Knowledge Extraction

The provided content consists of a single-word reaction ('impressive') to an external piece of media. It contains no technical data, assertions, or substantive information suitable for knowledge extraction.

x-feedpieter-abbeelsocial-mediaai

tweet / @pabbeel / Oct 21

GaussGym Enhances Locomotion AI Training Environments

GaussGym offers significant improvements to environments used for training AI in locomotion. This advancement is expected to facilitate more effective and efficient development of robotic and simulated agents capable of complex movement.

locomotion-trainingai-environmentsrobotics

“GaussGym provides an upgrade for environments used in training locomotion capabilities.”

Pieter Abbeel

D-REX: Differentiable Real-to-Sim-to-Real Engine for Dexterous Grasping

Robotic Peeling with Human Preference Alignment

Reward-Conditioned Reinforcement Learning for Adaptive Policies

Accelerating Materials Discovery via Clique-Based Offline Model-Based Optimization

XL-VLA: A Cross-Embodiment Latent Space for Dexterous Robot Manipulation

Pieter Abbeel Announces Collaboration with David

Trivial Social Media Engagement

Empty Research Congratulations

Amazon FAR Open-Sources Holosoma: A Full-Stack Robotics Platform for Sim-to-Real Transfer

Pieter Abbeel's Positive Reaction to Unspecified Content

Meta Releases Llama 2 with Microsoft Collaboration

Empty Content Analysis

Insufficient Content for Knowledge Extraction

GaussGym Enhances Locomotion AI Training Environments

Pieter Abbeel: Deep Reinforcement Learning | Lex Fridman Podcast #10

Reinforcement Learning Deep Dive with Pieter Abbeel - #28