LAMPAT: Low-rank Adaptation Multilingual Paraphrasing using Adversarial Training

About The Project

This is an implementation of LAMPAT: Low-rank Adaptation Multilingual Paraphrasing using Adversarial Training.

LAMPAT has been accepted at the 38th AAAI Conference on Artificial Intelligence (AAAI-24). Paper can be found at this link.

Getting Started

To get started, you should have prior knowledge on Python and PyTorch at first. A few resources to get you started if this is your first Python or PyTorch project:

Installation

Clone the repo

git clone https://github.com/phkhanhtrinh23/LAMPAT.git

Use any code editor to open the folder LAMPAT.

Run

Create conda virtual environment: conda create -n lampat python=3.8, activate it: conda activate lampat, and install the required packages: pip install -r requirements.txt.
Download wmt19_v18
Extract the files to .txt files, rename all of the files with their ISO 639-1 code, and place them in the path data/wmt19_v18. For example: data/wmt19_v18/en.txt
Read and run train.sh to train the LAMPAT model.

Evaluation

Evaluation dataset

The evaluation dataset can be downloaded at this link

Download the zip file and unzip it to put into the evaluation/eval_dataset

Run

In the evaluation folder, there are 3 python files:

mev_sup_multi_ref.py: used to evaluate on STAPLE multi-reference evaluation dataset
mev_sup.py: used to evaluate on PAWS-X and Opusparcus
mev_unsup.py: used to evaluate on WMT19

Each file will run the metrics and report the score to the console

Contribution

Contributions are what make GitHub such an amazing place to be learn, inspire, and create. Any contributions you make are greatly appreciated.

Fork the project
Create your Contribute branch: git checkout -b contribute/Contribute
Commit your changes: git commit -m 'add your messages'
Push to the branch: git push origin contribute/Contribute
Open a pull request

Contact

Email: phkhanhtrinh23@gmail.com

Project Link: https://github.com/phkhanhtrinh23/LAMPAT.git

Name		Name	Last commit message	Last commit date
Latest commit History 25 Commits
data		data
evaluation		evaluation
image		image
.gitignore		.gitignore
README.md		README.md
data_collator.py		data_collator.py
evaluate.py		evaluate.py
model.py		model.py
preprocess.py		preprocess.py
requirements.txt		requirements.txt
test.py		test.py
train.sh		train.sh
train_adv.py		train_adv.py

phkhanhtrinh23/LAMPAT

Folders and files

Latest commit

History

Repository files navigation

LAMPAT: Low-rank Adaptation Multilingual Paraphrasing using Adversarial Training

About The Project

Getting Started

Installation

Run

Evaluation

Evaluation dataset

Run

Contribution

Contact

About

Resources

Stars

Watchers

Forks

Languages