Astrology-Bot: Fine-tuned LLaMA-2-7B for Horoscope Chat and Tarot Reading enhanced with RAG

Motivation

Scientific testing has found no evidence to support the premises or purported effects outlined in astrological traditions. The continued belief in astrology despite its lack of credibility is seen as a demonstration of low scientific literacy, although some continue to believe in it even though they are scientifically literate. Let's make fun of it!.

Repo Overview

Astrology-Bot/
│
├── data/                                 
│   ├── horoscope.csv 
│   ├── tarot.csv
│   ├── horoscope_webscraping.ipynb
│   └── tarot_webscraping.ipynb            
│
├── interface/                            
│   ├── get_response.py                   
│   ├── inference.py                      
│   └── UI.py                             
│
├── model/                                
│   ├── embedding_model.py                
│   └── inference_model.py               
│
└── RAG/
    ├── chunk_data.py                     
    ├── index_data.py                     
    ├── main.py                           
    └── utils.py

Dataset

Horoscope Reading

Scrape the plain text from Astrology.com with Astrology.com on daily basis.

For each of the zodiac sign(aries, taurus, gemini, cancer, leo, virgo, libra, scorpio, sagittarius, capricorn, aquarius, pisces), I scraped love, daily and work.

Tarot

Scrape the plain text of the meaning of each tarot card in different position from biddytarot.com.

Pipeline

Chunking: The cleaned text data is chunked through sliding window with 200 words as window size and 50 as sliding step size.
Embed Text: The text is embedded with BGE-Large model which is selected through MTEB LeaderBoard.
Index Embedding: The embeddings are indexed into Pinecone. The retriver utilizes cosine similarity to retrieve relavant embeddings from the database.
Prompting: The query will be embedded with the same encoder. Then the retrieved text will be added into the prompt.
Inference: LLaMA-2-7B model is utilized to generate results. Due to the autoregressive nature, the generated text is post-processed and only the first answer is extracted as the final decision.

Results

Play around by yourself! Deployed with Streamlit Community Cloud!

Horoscope

As shown in the results, the generated context information is more readable and makes more sense after fine-tuning.

Before

After

Tarot

For RAG system use case:

Name		Name	Last commit message	Last commit date
Latest commit History 38 Commits
.streamlit		.streamlit
RAG		RAG
data		data
images		images
interface		interface
model		model
.gitignore		.gitignore
LICENSE		LICENSE
RAG.ipynb		RAG.ipynb
README.md		README.md
UI.py		UI.py
USE_POLICY.md		USE_POLICY.md
requirements.txt		requirements.txt

License

nogibjj/astrology-bot

Folders and files

Latest commit

History

Repository files navigation

Astrology-Bot: Fine-tuned LLaMA-2-7B for Horoscope Chat and Tarot Reading enhanced with RAG

Motivation

Repo Overview

Dataset

Horoscope Reading

Tarot

Pipeline

Results

Horoscope

Before

After

Tarot

About

Resources

License

Stars

Watchers

Forks

Languages