Collecting Data with AIRBOT Play

1. Introduction

This tutorial is about demonstration (single or double arm manipulation) for raw data collecting, data converting (convert raw data to hdf5 file used by the learning algorithms) and data replaying.

Please ask the customer service for all available dependency packages and source codes files. We will gradually provide relevant download links in the future.

Docker Notation

If you want to install and use these packages in Docker, you should run your container with some necessary args, for example:
docker run -it --name airbot_play --network=host --privileged=true -v /lib/modules:/lib/modules,/tmp/.X11-unix:/tmp/.X11-unix -e DISPLAY=$DISPLAY ubuntu:20.04
And you probably can not install the deb packages when they are in /root. In this case, you can move the packages and the terminal to /tmp first:
mv airbot_*.deb /tmp/ && cd /tmp
In some Docker Containers, there is no sudo command, you can just remove it from the commands below.

2. Environment Setup

Supported Operation Systems

Ubuntu 20.04 LTS AMD64(x86_64)
Ubuntu 20.04 LTS ARM64

2.1 Data Collection and Convertion Environment

The core control package for data collection should be in the format of airbot_play_<version>_<arch>.deb. Install this package with the following command:

sudo apt update
sudo apt install python3-pip python3 udev kmod iproute2 libcanberra-gtk-module libcanberra-gtk3-module -y
sudo service udev restart && udevadm control --reload
sudo apt install ./airbot_play_*.deb -y

The functional package for data collection should be in the format of airbot_aloha_<version>_<arch>.deb. Install this package with the following command:

sudo apt install ./airbot_aloha_*.deb

2.2 Data Replay Environment

Data Collection and Convertion Environment needs to be configured first.

This configuration will install the AIRBOT Play Python API.

The source code file name should be in the format of airbot_play_python_<version>.zip. Run the following command to extract and build the project:

sudo apt install librosconsole-dev liburdf-dev libspdlog-dev libfmt-dev git -y
cp airbot_play_python_2.7.zip airbot_play_python.zip
unzip airbot_play_python.zip -q && sudo rm -rf airbot_play_python.zip
cd airbot_play_python/python
git clone --depth 1 https://github.com/pybind/pybind11.git
mkdir build && cd build
cmake .. && make -j32 && cd ..

Note: If you manually download the pybind11 project, then you should extract and rename the folder to pybind11 and replace the empty pybind11 folder in airbot_play_python/python.

Finally, install the airbot python package via pip:

pip install . -i https://pypi.mirrors.ustc.edu.cn/simple/

3. Data Collection

Data Collection and Convertion Environment needs to be configured first.

The task name should be reasonable, and it is recommended to include time in the name to distinguish the same task data collected at different times.

3.1 Starting Robotic Arms

Prepare all teaching arms and execution robotic arms.
Connect the power sources of all robotic arms (order doesn't matter).
First, connect the teaching arm via Type-C data cable (corresponding to CAN0), then connect the execution arm, too (corresponding to CAN1). For dual-arm operations, follow the above sequence for the left-side robotic arm first, then the right-side arm.
Long-press the power button on each robotic arm to turn them on.
Ensure that the robotic arms are at the zero pose; otherwise, perform a zero calibration.

Note: Other devices connected to your computer may occupy the CAN interfaces, you may need to change the dufault can interfaces manually. Please refer to Explanation of Parameters.

3.2 Connecting Cameras

Data collection typically requires multiple cameras, and the connection order can be as follows:

Single-arm task sequence: Arm-mounted camera left eye -> Arm-mounted camera right eye -> Base-mounted camera
Dual-arm task: Left arm camera -> Right arm camera -> Environment camera top -> Environment camera bottom

The above sequence is just a reference: the actual connection order depends on the number of cameras and their placement. Generally, prioritize connecting arm-mounted, left-side, and overhead cameras first.

3.3 Starting Data Collection

airbot_demonstrate \
    -c <cam_device_0> -c <cam_device_1> -c <cam_device_2> \
    -mts <max_time_step> \
    -tn <task_name> \
    -se <start_episode_index> \
    -f 15 \
    -sjp <joint_pos_1> <joint_pos_2> <joint_pos_3> <joint_pos_4> <joint_pos_5> <joint_pos_6> <gripper_pos>

For dual-arm tasks, replace the command airbot_demonstrate with airbot_demonstrate_dual and change -sjp to -sjpl and -sjpr to specify the initial joint positions of left arm and right arm respectively (both defualt to 0 if not used).

3.3.1 Explanation of Parameters

-c: Device number of the USB cameras, the order of multiple device numbers (i.e., camera sequence) is specified by the specific task.
- Typically, follow the connection order, where available camera device numbers are incremented by even numbers, such as 0, 2, 4...
- If your computer has a built-in regular camera, it occupies one device number, and externally connected camera device numbers start from 2 and increment by even numbers: 2, 4, 6... (if cameras were connected before the computer started, the device number of the built-in camera may not be 0; in rare cases, after connecting cameras post-startup, the built-in camera's device number may not be 0).
- If your computer has a built-in depth/structured light camera (like Windows Hello camera), the RGB camera and depth camera each occupy a device number, and externally connected camera device numbers start from 4 and increment by even numbers: 4, 6, 8...
-mts: Specifies the maximum number of frames to be captured; specified by the specific task. When reaching the maximum time steps, the program will prompt.
-tn: Task name, specified by the specific task.
- The collected data will be saved in the demonstrations/(raw/)<task_name> folder in the current directory.
-se: Starting episode number for collection, default is 0.
- After interruption, you can modify this value to continue collection without overwriting previous data.
- When data is mistakenly saved, specifying this episode number allows for re-collection to overwrite existing data.
- When collecting data multiple times after executing the command, the episode numbers of subsequent collections will increment from this base.
- Each set of data is saved in the demonstrations/(raw/)<task_name>/<episode_id> folder.
-sjp: Initial positions of each joint and gripper before starting collection for each episode; defaults to 0 if not used. Specify values based on the actual situation of the specific task.
-f: Data collection frequency, default is 15Hz.

And there are some other parameters for flexible usage:

-m: the can interface of of the teacher arm, default to can0.
-n: the can interface of of the follower arm, default to can1.

3.3.2 Excution Example

One teacher with one follower:

airbot_demonstrate -c 0 -mts 100 -tn test_task -sjp 0.0 0.0 0.0 0.0 0.0 0.0 0.0

Two teachers with two followers:

airbot_demonstrate_dual -c 0 -mts 100 -tn test_task -sjpl 0.0 0.0 0.0 0.0 0.0 0.0 0.0 -sjpr 0.0 0.0 0.0 0.0 0.0 0.0 0.0

3.3.3 Key Descriptions

Do not press or hold keys continuously; otherwise, the key's behavior will repeat sequentially.

After excuting the command above, the terminal will be cleaned and you can use keyboard to control. The key descriptions are as follows:

g: Toggle gravity compensation on/off.
Spacebar: Start/stop episode data recording.
q: Discard the current record.
0: Return the robotic arm to the initial position.
p: Print current robotic arm states information. If the information is not displayed properly, increase your terminal width.

3.3.4 Operational Steps

Start the program, and the real-time windows of each camera will appear (if not all cameras start, try adjusting device numbers or checking camera connections; try connecting only one camera per docking station; some computers may support only 1-2 external cameras when USB ports share the same bus, consider changing the computer).
Press Spacebar to start recording data and simultaneously, teleoperate the robotic arm to complete the task.
After completing the task, wait to collect the specified number of frames (the number of frames used to complete the task should be as close as possible to the maximum collection frames):
- If the teaching opration is not acceptable, press q to discard the current teaching record, then press 0 to control the robotic arm to return to the initial position.
- If it is acceptable, press Spacebar to save the current teaching record. After saving, the robotic arm will automatically return to the initial position.
(Optional) In the demonstrations/(raw/)<task_name> folder in the current directory, check the recorded episodes.
- Each collected episode data includes: videos recorded by cameras (.avi), and a robotic arm status record file (.json).

Additional Notes:

Try to ensure that the completed task actions are completed just before reaching the maximum frame count, i.e., do not end the action too early.
The robotic arm movement speed should not be too fast; otherwise, the collected information will be sparse, and the image quality will not be high.
It is recommended to store the collected task data folder in the same directory structure on a portable hard drive as a backup.

4. Data Convertion

Data Collection and Convertion Environment needs to be configured first.

In the same path where the data collection command was executed, run the following command to save the data in hdf5 format:

/usr/bin/python3 -m airbot_aloha.convert_episodes -rn 1 -cn 0 -tn test_task -se 0 -ee 0 -rd ./demonstrations

Parameter explanation:

-cn: Specify camera names. For example, if there are 2 cameras, then specify -cn 0,1.
-tn: Specify the task name, same as specified during data collection.
-se: Specify the starting episode number of the data.
-ee: Specify the ending episode number of the data.
-rn: For dual-arm tasks, specify -rn 2.
-rd: Path to your data directory.

This command will search for the specified task data in the demonstrations(/raw) folder in the current directory and convert them to hdf5 files into the demonstrations/hdf5 folder.

Similarly, it is recommended to store the converted task data folder <task_name> in the same directory structure on a portable hard drive as a backup.

5. Data Replay (Optional)

Data replay can be used to verify if there are issues with collected data, init states of the environment, etc. (requires prior setup of the Data Replay Environment).

The data replay command and its parameters are as follows:

/usr/bin/python3 -m airbot_aloha.replay_episodes -rn 1 -tn test_task -ei 0 -ii -can 1 -cn 0

Parameter explanation:

-tn: Specify the task name.
-ei: Specify the ID corresponding to the HDF file.
-can: CAN ID of the follower arms. For dual-arm tasks, you can set -can 1 3 if you followed the connection order in Starting-Robotic-Arms. Also, you can unplug all the Type-C wires first and only connect the follower arms, then set -can 0 1.
-ii: Do not replay camera data.
-ia: Do not replay action data.
-rn: For dual-arm tasks, specify -rn 2.
-cn: The name of the cameras data to be replayed. For example, if there are 2 cameras, then specify -cn 0,1.