Logo image
On bridging consumer health search across languages using cross-lingual word space
Journal article   Peer reviewed

On bridging consumer health search across languages using cross-lingual word space

Chia-Hsuan Chang and Christopher C. Yang
Electronic commerce research and applications, v 59, 101254
May 2023

Abstract

Consumer health search Consumer health vocabulary Cross-lingual information retrieval Cross-lingual word space Search engine
Information search is a lucrative activity for electronic commerce, contributing to significant revenues. As search engine becomes the primary source for seeking health information, the search quality concerns consumers. Specifically, the health vocabulary used by general consumers differs from the professionals, leading to unsatisfied retrieval performance and misleading information. The situation is increasingly complex when consumers search health information across languages. To study consumer health search (CHS) across languages, we propose a cross-lingual retrieval framework comprising semantic space construction and information retrieval modules. The semantic space construction module adopts a weakly-supervised approach to determine a cross-lingual word space (CLWS) from collected consumer-generated health content, which helps capture consumers’ vocabulary and identify medical expression translations. The information retrieval module suggests strategies that utilize the translations of CLWS for the subsequent retrieval tasks. By evaluating the performance on two health information search engines, we found that Google Translate (GT), a widely-adopted translation service, is prone to generate mistranslations to the colloquial medical expressions. The CLWS helps identify GT’s mistranslations and filter out the results retrieved by those. More importantly, our framework demonstrates a strategy to integrate translations of CLWS and GT to reach the best retrieval performance for the cross-lingual CHS task. •We propose a retrieval framework for facilitating cross-lingual consumer health search using cross-lingual word space.•A weakly supervised cross-lingual word space helps translate health queries.•Google Translate is prone to generate mistranslations to the colloquial health queries.•Our proposed framework complements Google Translate to deliver better retrieval performance.

Metrics

14 Record Views
2 citations in Scopus

Details

UN Sustainable Development Goals (SDGs)

This publication has contributed to the advancement of the following goals:

#3 Good Health and Well-Being

InCites Highlights

Data related to this publication, from InCites Benchmarking & Analytics tool:

Collaboration types
Domestic collaboration
International collaboration
Web of Science research areas
Business
Computer Science, Information Systems
Computer Science, Interdisciplinary Applications
Logo image