Command Palette
Search for a command to run...
Recognition of Instrument-Tissue Interactions in Endoscopic Videos via Action Triplets
Nwoye Chinedu Innocent ; Gonzalez Cristians ; Yu Tong ; Mascagni Pietro ; Mutter Didier ; Marescaux Jacques ; Padoy Nicolas

Abstract
Recognition of surgical activity is an essential component to developcontext-aware decision support for the operating room. In this work, we tacklethe recognition of fine-grained activities, modeled as action triplets representing the tool activity. To this end, weintroduce a new laparoscopic dataset, CholecT40, consisting of 40 videos fromthe public dataset Cholec80 in which all frames have been annotated using 128triplet classes. Furthermore, we present an approach to recognize thesetriplets directly from the video data. It relies on a module called ClassActivation Guide (CAG), which uses the instrument activation maps to guide theverb and target recognition. To model the recognition of multiple triplets inthe same frame, we also propose a trainable 3D Interaction Space, whichcaptures the associations between the triplet components. Finally, wedemonstrate the significance of these contributions via several ablationstudies and comparisons to baselines on CholecT40.
Code Repositories
Benchmarks
| Benchmark | Methodology | Metrics |
|---|---|---|
| action-triplet-recognition-on-cholect40 | Tripnet | mAP: 18.95 |
| action-triplet-recognition-on-cholect50 | Tripnet (TensorFlow v1) | Mean AP: 20.0 |
| action-triplet-recognition-on-cholect50-1 | Tripnet (TensorFlow v1) | mAP: 23.4 |
Build AI with AI
From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.