In this study, the researchers developed a novel model for surgical action triplet recognition. This model consists of two key components: acapable of handling multiple tasks simultaneously in surgical video, and a loss function designed for multiple similar labels."Both spatial and temporal features within surgical videos are considered by our framework, a departure from previous methods that predominantly focused on spatial features alone," said Prof. Jia.
The proposed model outperformed existing methods, including Triplet, Attention Triplet, and Rendezvous approaches. Compared to the state-of-the-art Rendezvous method, the model achieved average precision improvements of 4.6%, 4.0%, and 7.8% in instrument, action, and organ recognition tasks, reaching 82.1%, 51.5%, and 45.5%, respectively. In the overall triplets' recognition task, the proposed model also improved by 3.1% in average precision, reaching 35.8%.
"In future work, we aim to enhance recognition accuracy based on the proposed model framework," said Prof. Jia.Yuchong Li et al, MT-FiST: A Multi-Task Fine-grained Spatial-Temporal Framework for Surgical Action Triplet Recognition,
Technology Technology Latest News, Technology Technology Headlines
Similar News:You can also read news stories similar to this one that we have collected from other news sources.
Source: medical_xpress - 🏆 101. / 51 Read more »