This repository contains the implementation of our paper "Auto-labelling of Bug Report using Natural Language Processing".
Duplicate bug report detection in tracking systems saves debugging efforts. Traditional solutions lack clear ranking,
deterring their use. Our paper introduces an NLP-based method using bug report attributes, leveraging a neural
network for retrieval.
@INPROCEEDINGS{10126470,
author={Patil, Avinash and Jadon, Aryan},
booktitle={2023 IEEE 8th International Conference for Convergence in Technology (I2CT)},
title={Auto-labelling of Bug Report using Natural Language Processing},
year={2023},
volume={},
number={},
pages={1-7},
doi={10.1109/I2CT57861.2023.10126470}}
- Make sure you have Python and pip installed.
Steps:
-
Clone the repository (if you haven't already):
git clone https://github.com/aryan-jadon/DuplicateBugDetection.git cd DuplicateBugDetection
-
(Optional but recommended) Set up a virtual environment:
python -m venv venv source venv/bin/activate # On Windows, use `venv\Scripts\activate`
-
Install the required packages:
pip install -r requirements.txt
After following these steps, all necessary libraries should be installed, and you're ready to execute the project.
-
This is the first script you'll need to run. Its main purpose is to create a mapping of bugs, which can be utilized in subsequent scripts.
python create_bugs_map.py
-
After generating the bugs map, the next step is to split the data into training and testing datasets.
python create_train_test.py
-
Once you have the training and testing data ready, this is the final script you will run which might contain the main algorithm or process of the project.
python main_file.py