Skip to content

A Telegram bot which extract Text from PDF, also extract the Images of PDF Pages. Made with Python

License

Notifications You must be signed in to change notification settings

PrathameshDhande22/PdfTxtBot

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

14 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Telegram Bot for PDF Text Extraction and Image Conversion

GitHub GitHub last commit ptb-verison

This is a Telegram bot created with Python-telegram-bot v20.1 module that allows users to extract the text from a PDF document and convert PDFs to images.

Requirements :

  • python-telegram-bot
  • pdf2jpg
  • PyPDF
  • Python 3.x

Install Packages :

pip install -r requirements.txt

Usage :

/start : Starts the bot.

/help : Get the help description of the bot.

Just send the PDF to the bot, choose any one option Extract Text or Extract Images

Extract Text: It will extract the text from pdf and uploads the txt file from pdf.

Extract Images: It will convert the pdf to images and upload the images one by one.

How to cofigure the bot to Run :

Note : This bot is made to be run on a Local System.

Just run this command

Run the Bot:

python main.py

Output :

Author : Prathamesh Dhande