Ki-Cook: clustering multimodal cooking representations through knowledge-infused learning
{{output}}
Cross-modal recipe retrieval has gained prominence due to its ability to retrieve a text representation given an image representation and vice versa. Clustering these recipe representations based on similarity is essential to retrieve relevant information abou... ...