Journal article
SF-GPT: A training-free method to enhance capabilities for knowledge graph construction in LLMs
Neurocomputing (Amsterdam), v 613, 128726
14 Jan 2025
Abstract
Knowledge graphs (KGs) are constructed by extracting knowledge triples from text and fusing knowledge, enhancing information retrieval efficiency. Current methods for knowledge triple extraction include ”Pretrain and Fine-tuning” and Large Language Models (LLMs). The former shifts effort from manual extraction to dataset annotation and suffers from performance degradation with different test and training set distributions. LLMs-based methods face errors and incompleteness in extraction. We introduce SF-GPT, a training-free method to address these issues. Firstly, we propose the Entity Extraction Filter (EEF) module to filter triple generation results, addressing evaluation and cleansing challenges. Secondly, we introduce a training-free Entity Alignment Module based on Entity Alias Generation (EAG), tackling semantic richness and interpretability issues in LLM-based knowledge fusion. Finally, our Self-Fusion Subgraph strategy uses multi-response self-fusion and a common entity list to filter triple results, reducing noise from LLMs’ multi-responses. In experiments, SF-GPT showed a 55.5% increase in recall and a 32.6% increase in F1 score on the BDNC dataset compared to the UniRel model trained on the NYT dataset and achieved a 5% improvement in F1 score compared to GPT-4+EEF baseline on the WebNLG dataset in the case of a fusion round of three. SF-GPT offers a promising way to extract knowledge from unstructured information.
Metrics
39 Record Views
Details
- Title
- SF-GPT: A training-free method to enhance capabilities for knowledge graph construction in LLMs
- Creators
- Lizhuang SunPeng ZhangFang GaoYuan AnZhixing LiYuanwei Zhao
- Publication Details
- Neurocomputing (Amsterdam), v 613, 128726
- Publisher
- Elsevier; AMSTERDAM
- Number of pages
- 19
- Resource Type
- Journal article
- Language
- English
- Academic Unit
- Information Science (Informatics)
- Web of Science ID
- WOS:001348564300001
- Scopus ID
- 2-s2.0-85207646223
- Other Identifier
- 991021931915604721
InCites Highlights
Data related to this publication, from InCites Benchmarking & Analytics tool:
- Web of Science research areas
- Computer Science, Artificial Intelligence