Automatic (near-) duplicate content document detection in a cancer registry

Background: Duplicate and near-duplicate medical documents are problematic in document management, clinical use, and medical research. In this study, we focus on multisourced medical documents in the context of a population-based... ...

请注册登录后继续浏览