MOT-DETR : 3D single shot detection and tracking with transformers to build 3D representations for agro-food robots

MOT-DETR : 3D single shot detection and tracking with transformers to build 3D representations for agro-food robots

In the current demand for automation in the agro-food industry, accurately detecting and localizing relevant objects in 3D is essential for successful robotic operations. However, this is a challenge due the presence of occlusions. Multi-view perception approaches allow robots to overcome occlusions, but a tracking component is needed to associate the objects detected by the robot over multiple viewpoints. Most multi-object tracking (MOT) algorithms are designed for high frame rate sequences and struggle with the occlusions generated by robots’ motions and 3D environments. In this paper, we introduce MOT-DETR, a novel approach to detect and track objects in 3D over time using a combination of convolutional networks and transformers. Our method processes 2D and 3D data, and employs a transformer architecture to perform data fusion. We show that MOT-DETR outperforms state-of-the-art multi-object tracking methods. Furthermore, we prove that MOT-DETR can leverage 3D data to deal with long-term occlusions and large frame-to-frame distances better than state-of-the-art methods. Finally, we show how our method is resilient to camera pose noise that can affect the accuracy of point clouds. The implementation of MOT-DETR can be found here: https://github.com/drapado/mot-detr.

Saved in:

Bibliographic Details
Main Authors:	Rapado-Rincon, David, Nap, Henk, Smolenova, Katarina, van Henten, Eldert J., Kootstra, Gert
Format:	Article/Letter to editor biblioteca
Language:	English
Subjects:	Deep learning, Multi-object tracking, Robotics, Transformers,
Online Access:	https://research.wur.nl/en/publications/mot-detr-3d-single-shot-detection-and-tracking-with-transformers-
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

Development and evaluation of automated localisation and reconstruction of all fruits on tomato plants in a greenhouse based on multi-view perception and 3D multi-object tracking
by: Rapado-Rincón, David, et al.

Robust node detection and tracking in fruit-vegetable crops using deep learning and multi-view imaging
by: Boogaard, Frans P., et al.

MinkSORT : A 3D deep feature extractor using sparse convolutions to improve 3D multi-object tracking in greenhouse tomato plants
by: Rapado-Rincón, David, et al.

ChickTrack - A Quantitative Tracking Tool for Measuring Chicken Activity
by: Neethirajan, S.R.

Automatic discard registration in cluttered environments using deep learning and object tracking: class imbalance, occlusion, and a comparison to human review
by: van Essen, Rick, et al.

Data underlying the publication: Automatic discard registration in cluttered environments using deep learning and object tracking: class imbalance, occlusion, and a comparison to human review
by: van Essen, Rick, et al.

Dataset of UAV thermal video sequences with annotations for MOTS benchmarking
by: Bárbulo Barrios, Diego, et al.

Quantitatively scoring behavior from video-recorded, long-lasting fish trajectories
by: Martí-Puig, Pere, et al.
Published: (2018-08)

APPLE MOTS: Detection, Segmentation and Tracking of Homogeneous Objects Using MOTS
by: de Jong, Stefan, et al.

Enhanced camera-based individual pig detection and tracking for smart pig farms
by: Guo, Qinghua, et al.

Passive radio frequency identification and video tracking for the determination of location and movement of broilers
by: Doornweerd, J.E., et al.

Application-specific evaluation of a weed-detection algorithm for plant-specific spraying
by: Ruigrok, Thijs, et al.

Monitoring mammalian herbivores via convolutional neural networks implemented on thermal UAV imagery
by: Bárbulo Barrios, Diego, et al.

Novel multi-omics deconfounding variational autoencoders can obtain meaningful disease subtyping
by: Li, Zuqi, et al.

Automatic phenotyping of tomatoes in production greenhouses using robotics and computer vision : From theory to practice
by: Fonteijn, Hubert, et al.

Deep learning-based multi-task prediction system for plant disease and species detection
by: Keceli, Ali Seydi, et al.

Automated Tracking Systems for the Assessment of Farmed Poultry
by: Neethirajan, Suresh

Joint 2D to 3D image registration workflow for comparing multiple slice photographs and CT scans of apple fruit with internal disorders
by: Schut, Dirk Elias, et al.

A deep learning framework for matching of SAR and optical imagery
by: Hughes, Lloyd Haydn, et al.

DNNGP, a deep neural network-based method for genomic prediction using multi-omics data in plants
by: Wang, K., et al.
Published: (2023)

Automated identification and counting of predated Ephestia kuehniella (Zeller) eggs using deep learning image analysis
by: Mouratidis, Angelos, et al.

Boosting plant-part segmentation of cucumber plants by enriching incomplete 3D point clouds with spectral data
by: Boogaard, Frans P., et al.

Deep learning methods improve genomic prediction of wheat breeding
by: Montesinos-Lopez, A., et al.
Published: (2024)

Beyond multicultural ‘tolerance’: guided tours and guidebooks as transformative tools for civic learning
by: Ormond, M.E., et al.

Cognitive Computing Advancements: Improving Precision Crop Protection through UAV Imagery for Targeted Weed Monitoring
by: Mesías-Ruiz, Gustavo A., et al.
Published: (2024-08-18)

Boosting precision crop protection towards agriculture 5.0 via machine learning and emerging technologies: A contextual review
by: Mesías-Ruiz, Gustavo A., et al.
Published: (2023-03-22)

Boosting precision crop protection towards agriculture 5.0 via machine learning and emerging technologies: A contextual review
by: Mesías-Ruiz, Gustavo A., et al.
Published: (2023-03-22)

Editorial: Synthetic data for computer vision in agriculture
by: Afonso, Manya, et al.

Multi-temporal land cover classification with sequential recurrent encoders
by: Rußwurm, Marc, et al.

Multitemporal Very High Resolution From Space: Outcome of the 2016 IEEE GRSS Data Fusion Contest
by: Mou, L., et al.

Fleets of robots for environmentally-safe pest control in agriculture
by: González-de-Santos, Pablo, et al.
Published: (2017-08)

Towards detecting floating objects on a global scale with learned spatial features using sentinel 2
by: Mifdal, Jamila, et al.

High-Throughput Plot-Level Quantitative Phenotyping Using Convolutional Neural Networks on Very High-Resolution Satellite Images
by: Victor, Brandon, et al.

TransfQMix: Transformers for Leveraging the Graph Structure of Multi-Agent Reinforcement Learning Problems
by: Gallici, Matteo, et al.
Published: (2023-05)

Automated River Plastic Monitoring Using Deep Learning and Cameras
by: van Lieshout, Colin, et al.

OPTIMA - RGB colour images and multispectral images (including LabelImg annotations)
by: Blok, Pieter M., et al.

New deep learning genomic prediction model for multi-traits with mixed binary, ordinal, and continuous phenotypes
by: Montesinos-López, Osval A., et al.
Published: (2018)

Underwater Multi-Target Tracking with Particle Filters
by: Masmitja, Ivan, et al.
Published: (2018-05)

DeepSTARia: enabling autonomous, targeted observations of ocean life in the deep sea
by: Barnard, Kevin, et al.
Published: (2024-04)

Detección de objetos en imágenes mediante aprendizaje sin ejemplos
by: Urquiza Toledo, Agustín Horacio
Published: (2021)

Resource Map