OntoFact: Unveiling Fantastic Fact-Skeleton of LLMs via Ontology-Driven Reinforcement Learning

This is the source code of our AAAI 2024 paper "OntoFact: Unveiling Fantastic Fact-Skeleton of LLMs via Ontology-Driven Reinforcement Learning" Paper

Quick Links

OntoFact: Unveiling Fantastic Fact-Skeleton of LLMs via Ontology-Driven Reinforcement Learning
- Quick Links
- Overview
Usage
Citation

Overview

We propose OntoFact, a novel adaptive framework for detecting unknown facts in LLMs, dedicated to mining the ontology-level skeleton of the missing knowledge. The following figure is an illustration of our methods.

Usage

Getting Started

The structure of the folder is shown below:

 OntoFact
 ├─KG_embedding
 ├─LLMs_Factuality_Evaluation
 ├─Ontology-Driven_Reinforcement_Learning
 ├─Dataset_Process_Code
 ├─requirements.txt
 └README.md

Introduction to the structure of the folder:

/KG_embedding: Source code of knowledge graph embedding (KGE) for training in 5 datasets.
/LLMs_Factuality_Evaluation: Source code of factual evaluation of 32 LLMs on 5 datasets using predefined prompt templates.
/Ontology-Driven_Reinforcement_Learning: Source code of the ontology-driven reinforcement (ORL) learning.
/Dataset_Process_Code: Source code of the 5 benchmarks built.

Environment Installation

See requirements.txt

For training and limited evaluation

# python >= 3.9
# Basic pytorch environment, if different LLMs require different versions, please substitute as appropriate. 
pip install torch==1.13.1+cu117 torchvision==0.14.1+cu117 torchaudio==0.13.1 --extra-index-url https://download.pytorch.org/whl/cu117
# When python >= 3.10, please refer to [Link](https://github.com/facebookresearch/faiss/wiki/Installing-Faiss#compiling-the-python-interface-within-an-anaconda-install) install faiss-gpu. 
conda install -c pytorch faiss-gpu==1.7.3
pip install transformers tqdm

For evaluation

# -- Prepare/Train KG Embeddings --
# 1. Download all benchmarks (DBpedia, YAGO, CN-DBpedia, BIOS 2.2 (ENG), BIOS 2.2 (CHS)) from [Google Drive](https://drive.google.com/drive/folders/1vqPhgdISICLs-yPi6OTBg3Ik9D0YyGuk?usp=drive_link) to ./KG_embedding/data
# 2. Run the code with this in the shell:
cd ./KG_embedding
sh ./train.sh
# 3. Wait for the training to finish or simply download the trained embedded file from [Google Drive](https://drive.google.com/drive/folders/1vqPhgdISICLs-yPi6OTBg3Ik9D0YyGuk?usp=drive_link) to ./KG_embedding/model. 
# 4. Run the code with this in the shell: (Then you will obtain the embeddings of isntance and ontology graph in the current directory)
cd ./KG_embedding
python ./KG_embedding/generate_embedding_npy.py

Training & Evaluation

Training Scripts:

cd ./LLMs_Factuality_Evaluation
sh train.sh

Evaluation ORL Scripts:

# 1. Download all processed data from [Google Drive](https://drive.google.com/drive/folders/1vqPhgdISICLs-yPi6OTBg3Ik9D0YyGuk?usp=drive_link) to ./Ontology-Driven_Reinforcement_Learning/data
# 2. Run the code with this in the shell
cd ./Ontology-Driven_Reinforcement_Learning
sh train.sh

Q&A

NOTE: Due to time constraints, the submitted code has not been refactored, so in some cases it may contain some bugs that we didn't catch, but that doesn't affect the results in our paper.

If you have any questions, please submit an issue or contact ziyus1999<AT>gmail.com or ziyus1999<AT>seu.edu.cn.

Datasets and Evaluation Detailed Results can be found at this link: google drive

Citation

If you find this method or code useful, please cite

@inproceedings{shang2024ontofact,
  title={Ontofact: Unveiling fantastic fact-skeleton of llms via ontology-driven reinforcement learning},
  author={Shang, Ziyu and Ke, Wenjun and Xiu, Nana and Wang, Peng and Liu, Jiajun and Li, Yanhui and Luo, Zhizhao and Ji, Ke},
  booktitle={Proceedings of the AAAI Conference on Artificial Intelligence},
  volume={38},
  number={17},
  pages={18934--18943},
  year={2024}
}

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

OntoFact: Unveiling Fantastic Fact-Skeleton of LLMs via Ontology-Driven Reinforcement Learning

Quick Links

Overview

Usage

Getting Started

Environment Installation

Training & Evaluation

Q&A

Citation

About

Releases

Packages

Contributors 2

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 5 Commits
Dataset_Process_Code		Dataset_Process_Code
KG_embedding		KG_embedding
LLMs_Factuality_Evaluation		LLMs_Factuality_Evaluation
Ontology-Driven_Reinforcement_Learning		Ontology-Driven_Reinforcement_Learning
_doc		_doc
README.md		README.md
requirements.txt		requirements.txt

seukgcode/OntoFact

Folders and files

Latest commit

History

Repository files navigation

OntoFact: Unveiling Fantastic Fact-Skeleton of LLMs via Ontology-Driven Reinforcement Learning

Quick Links

Overview

Usage

Getting Started

Environment Installation

Training & Evaluation

Q&A

Citation

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

Packages