Demo: Table Pick and Place

Description

A pick and place is a common task in manufacturing and production lines. In this demo the camera identifies objects and recognizes object bounding box. The robot's movements are planned accordingly.

The objective of this demo is to program robot to pick parts randomly placed on a table after preparing the training data and training.

This demo shows how to identify an object and determine its location and orientation.

Step by step

We first need to manually prepare training data, using approximately 60 images of an object. For each image, a bounding box of an object needs to be indicated.

The images must then be split in to training and validation sets.

The following steps are explained in subsequent sections:

Capturing the images
Preparing the training set
Preparing the validation set
Training the model
Testing results

To capture the images

Record images using AI Accelerator Dashboard. Every time you tap Save Image, the current camera view is saved as PNG image in a folder pandai_ark/ros/data/images on Compute module.

To prepare the training set

Open the web browser on the compute module and navigate to MakeSense.ai

This publicly available web site helps to label and annotate images.
Click Get Started and upload 80% of your images.
When the upload is complete, click Object Detection.
You need to specify labels for the objects you want to be recognized. Click the labels list and define at least one label.

Name the new label object_1.

You can define multiple labels and map multiple objects on a single image, as well as multiple instances of the same object.
Click Start project.
Specify a bounding box for each image.

Use the polygon tool and click points to map the bounding box.
Select object label for this new polygon.

Once you annotate all the images in the training set, click Actions and Export Annotations.
Select Single file in COCO JSON format and click Export.
Save the exported file to Downloads.
Inside the /ros/data/datasets create folder rtdetr_active
Inside create "train" folder and copy images of the training set and downloaded coco json file in here. Rename json file to coco_train.json

The training data consists of a combination of the JSON file and images.

To prepare the validation set

Repeat the above steps to create the validation set. Starting with uploading remaining 20% of the images to MakeSense.ai
Make sure to use the same label as in the training set.
Create folder validation inside the rtdetr_active
Rename downloaded coco json file to coco_validation.json and copy it together with the images from validation set in to the validation folder.

To train the model

Train the model using GUI (you can restart it from run_ark.sh )
Use Train model in rtdetr_active column of the GUI.

The training of the model can take anything from several minutes, depending on the total number of images and labeled objects in each image.

Testing the model

Run the run_ark.sh on the compute module.
Load the model in rtdetr_active column of the GUI.
Open AI Accelerator Dashboard on the robot.
Place an object within camera view.
Select detection2d view from the drop-down.
Tap Detect in the right column.
If an object is recognized a green bounding box will appear in camera view.

Example of using object location

A specific robot position stored in this program. Before executing this program check that robot can freely move to detect_wp and poses no risks.

Set program speed to 10%
Select detect_wp and tap Move Here
Verify that there are no obstructions
If necessary, Freedrive robot to a new position and save it as detect_wp

Included withAI AcceleratorSDK you can find example of a robot program using the recognition results.

On the robot open ark_example_detect program, installed during setup.
Place objects within camera view.
Run the program.
Robot moves to waypoint detect_wp and captures an image.

The function ark_detection_retrieve() returns robot pose matching the bounding box of the recognized object.
If a pre-trained object is recognized, the robot will move to position 150 mm above an object. If multiple objects are recognized, robot will randomly choose one.