IEEE transactions on medical imaging. 2017 Jan;36(1):86-97. doi: 10.1109/TMI.2016.2593957

EndoNet: A Deep Architecture for Recognition Tasks on Laparoscopic Videos

内镜网络：腹腔镜视频识别任务的深度架构翻译改进

Andru P Twinanda, Sherif Shehata, Didier Mutter, Jacques Marescaux, Michel de Mathelin, Nicolas Padoy

作者单位 +展开

作者单位

DOI: 10.1109/TMI.2016.2593957 PMID: 27455522

Surgical workflow recognition has numerous potential medical applications, such as the automatic indexing of surgical video databases and the optimization of real-time operating room scheduling, among others. As a result, surgical phase recognition has been studied in the context of several kinds of surgeries, such as cataract, neurological, and laparoscopic surgeries. In the literature, two types of features are typically used to perform this task: visual features and tool usage signals. However, the used visual features are mostly handcrafted. Furthermore, the tool usage signals are usually collected via a manual annotation process or by using additional equipment. In this paper, we propose a novel method for phase recognition that uses a convolutional neural network (CNN) to automatically learn features from cholecystectomy videos and that relies uniquely on visual information. In previous studies, it has been shown that the tool usage signals can provide valuable information in performing the phase recognition task. Thus, we present a novel CNN architecture, called EndoNet, that is designed to carry out the phase recognition and tool presence detection tasks in a multi-task manner. To the best of our knowledge, this is the first work proposing to use a CNN for multiple recognition tasks on laparoscopic videos. Experimental comparisons to other methods show that EndoNet yields state-of-the-art results for both tasks.

Keywords：deep architecture; recognition tasks; laparoscopic videos

关键词：深度架构; 识别任务; 腹腔镜视频

相关内容

全文链接

官方链接

PMC全文

引文链接

复制

已复制！

格式：

EndoNet: A Deep Architecture for Recognition Tasks on Laparoscopic Videos

内镜网络：腹腔镜视频识别任务的深度架构翻译改进

RSDNet: Learning to Predict Remaining Surgery Duration from Laparoscopic Videos Without Manual Annotations

基于腹腔镜视频的剩余手术时间预测方法RSDNet：无需人工标注

Retrieval independence between parts and wholes in successive recognition tasks

连续识别任务中部分和整体之间的检索独立性

Dual-correlate optimized coarse-fine strategy for monocular laparoscopic videos feature matching via multilevel sequential coupling feature descriptor

通过多级顺序耦合特征描述符进行单目腹腔镜视频特征匹配的双相关优化粗细策略

PATG: position-aware temporal graph networks for surgical phase recognition on laparoscopic videos

基于位置感知时序图网络的腹腔镜手术相位识别技术

[Alexithymia and memory: a more rigorous criterion for acceptance of recognition tasks?]

[ alexithymia与记忆：识别任务接受更为严格的准则？]

Orthographic neighborhood effects in recognition and recall tasks in a transparent orthography

透明文字系统中字母邻近效应在识别和回忆任务中的作用

Bubbles: a technique to reveal the use of information in recognition tasks

气泡技术：揭示识别任务中信息使用情况的方法

EasyLabels: weak labels for scene segmentation in laparoscopic videos

易标签：腹腔镜视频场景分割的弱标注方法

EndoNet: A Deep Architecture for Recognition Tasks on Laparoscopic Videos

内镜网络：腹腔镜视频识别任务的深度架构 翻译改进

RSDNet: Learning to Predict Remaining Surgery Duration from Laparoscopic Videos Without Manual Annotations

基于腹腔镜视频的剩余手术时间预测方法RSDNet：无需人工标注

Retrieval independence between parts and wholes in successive recognition tasks

连续识别任务中部分和整体之间的检索独立性

Dual-correlate optimized coarse-fine strategy for monocular laparoscopic videos feature matching via multilevel sequential coupling feature descriptor

通过多级顺序耦合特征描述符进行单目腹腔镜视频特征匹配的双相关优化粗细策略

PATG: position-aware temporal graph networks for surgical phase recognition on laparoscopic videos

基于位置感知时序图网络的腹腔镜手术相位识别技术

[Alexithymia and memory: a more rigorous criterion for acceptance of recognition tasks?]

[ alexithymia与记忆：识别任务接受更为严格的准则？]

Orthographic neighborhood effects in recognition and recall tasks in a transparent orthography

透明文字系统中字母邻近效应在识别和回忆任务中的作用

Bubbles: a technique to reveal the use of information in recognition tasks

气泡技术：揭示识别任务中信息使用情况的方法

EasyLabels: weak labels for scene segmentation in laparoscopic videos

易标签：腹腔镜视频场景分割的弱标注方法

内镜网络：腹腔镜视频识别任务的深度架构翻译改进