lerobot/act_aloha_sim_transfer_cube_human

我要开发同款
匿名用户2024年07月31日
49阅读
所属分类aiPytorch
开源地址https://modelscope.cn/models/ccc141/act_aloha_sim_transfer_cube_human
授权协议Apache License 2.0

作品详情

介绍

搬运自HuggingFace的lerobot项目,用于解决~~HF网络异常~~导致的模型无法访问问题
[modelrepo]: https://huggingface.co/lerobot/actalohasimtransfercubehuman

Model Card for ACT/AlohaTransferCube

Action Chunking Transformer Policy (as per Learning Fine-Grained Bimanual Manipulation with Low-Cost Hardware) trained for the AlohaTransferCube environment from gym-aloha.

demo

How to Get Started with the Model

See the LeRobot library (particularly the evaluation script) for instructions on how to load and evaluate this model.

Training Details

Trained with LeRobot@d747195.

The model was trained using LeRobot's training script and with the alohasimtransfercubehuman dataset, using this command:

python lerobot/scripts/train.py \
  hydra.job.name=act_aloha_sim_transfer_cube_human \
  hydra.run.dir=outputs/train/act_aloha_sim_transfer_cube_human \
  policy=act \
  policy.use_vae=true \
  env=aloha \
  env.task=AlohaTransferCube-v0 \
  dataset_repo_id=lerobot/aloha_sim_transfer_cube_human \
  training.eval_freq=10000 \
  training.log_freq=250 \
  training.offline_steps=100000 \
  training.save_model=true \
  training.save_freq=25000 \
  eval.n_episodes=50 \
  eval.batch_size=50 \
  wandb.enable=true \
  device=cuda

The training curves may be found at https://wandb.ai/alexander-soare/Alexander-LeRobot/runs/hjdard15.

This took about 2.5 hours to train on an Nvida RTX 3090.

Evaluation

The model was evaluated on the AlohaTransferCube environment from gym-aloha and compared to a similar model trained with the original ACT repository. Each episode marks a success if the cube is successfully picked by one robot arm and transferred to the other robot arm.

Here are the success rate results for 500 episodes worth of evaluation. The first row is the naive mean. The second row assumes a uniform prior and computes the beta posterior, then calculates the mean and lower/upper confidence bounds (with a 68.2% confidence interval centered on the mean). The "Theirs" column is for an equivalent model trained on the original ACT repository and evaluated on LeRobot (the model weights may be found in the original_act_repo branch of this respository). The results of each of the individual rollouts may be found in eval_info.json.

|Ours|Theirs -|-|- Success rate for 500 episodes (%) | 87.6 | 68.0 Beta distribution lower/mean/upper (%) | 86.0 / 87.5 / 88.9 | 65.8 / 67.9 / 70.0

The original code was heavily refactored, and some bugs were spotted along the way. The differences in code may account for the difference in success rate. Another possibility is that our simulation environment may use slightly different heuristics to evaluate success (we've observed that success is registered as soon as the second arm's gripper makes antipodal contact with the cube). Finally, one should observe that the in-training evaluation jumps up towards the end of training. This may need further investigation (Is it statistically significant? If so, what is the cause?).

声明:本文仅代表作者观点,不代表本站立场。如果侵犯到您的合法权益,请联系我们删除侵权资源!如果遇到资源链接失效,请您通过评论或工单的方式通知管理员。未经允许,不得转载,本站所有资源文章禁止商业使用运营!
下载安装【程序员客栈】APP
实时对接需求、及时收发消息、丰富的开放项目需求、随时随地查看项目状态

评论