Journal article
On bridging consumer health search across languages using cross-lingual word space
Electronic commerce research and applications, v 59, 101254
May 2023
Featured in Collection : UN Sustainable Development Goals @ Drexel
Abstract
Information search is a lucrative activity for electronic commerce, contributing to significant revenues. As search engine becomes the primary source for seeking health information, the search quality concerns consumers. Specifically, the health vocabulary used by general consumers differs from the professionals, leading to unsatisfied retrieval performance and misleading information. The situation is increasingly complex when consumers search health information across languages. To study consumer health search (CHS) across languages, we propose a cross-lingual retrieval framework comprising semantic space construction and information retrieval modules. The semantic space construction module adopts a weakly-supervised approach to determine a cross-lingual word space (CLWS) from collected consumer-generated health content, which helps capture consumers’ vocabulary and identify medical expression translations. The information retrieval module suggests strategies that utilize the translations of CLWS for the subsequent retrieval tasks. By evaluating the performance on two health information search engines, we found that Google Translate (GT), a widely-adopted translation service, is prone to generate mistranslations to the colloquial medical expressions. The CLWS helps identify GT’s mistranslations and filter out the results retrieved by those. More importantly, our framework demonstrates a strategy to integrate translations of CLWS and GT to reach the best retrieval performance for the cross-lingual CHS task.
•We propose a retrieval framework for facilitating cross-lingual consumer health search using cross-lingual word space.•A weakly supervised cross-lingual word space helps translate health queries.•Google Translate is prone to generate mistranslations to the colloquial health queries.•Our proposed framework complements Google Translate to deliver better retrieval performance.
Metrics
Details
- Title
- On bridging consumer health search across languages using cross-lingual word space
- Creators
- Chia-Hsuan Chang - Drexel University, Information ScienceChristopher C. Yang - Drexel University, Information Science
- Publication Details
- Electronic commerce research and applications, v 59, 101254
- Publisher
- Elsevier
- Resource Type
- Journal article
- Language
- English
- Academic Unit
- Information Science
- Web of Science ID
- WOS:000959491000001
- Scopus ID
- 2-s2.0-85150457253
- Other Identifier
- 991021853710804721
UN Sustainable Development Goals (SDGs)
This publication has contributed to the advancement of the following goals:
InCites Highlights
Data related to this publication, from InCites Benchmarking & Analytics tool:
- Collaboration types
- Domestic collaboration
- International collaboration
- Web of Science research areas
- Business
- Computer Science, Information Systems
- Computer Science, Interdisciplinary Applications