tokyo, a REST API, when given any type of document 📄, Identifies mime-type 🧐. Suggests extension 🦔. Alas Extracts text 💪.
-
Updated
Jun 13, 2020 - Clojure
tokyo, a REST API, when given any type of document 📄, Identifies mime-type 🧐. Suggests extension 🦔. Alas Extracts text 💪.
Heuristic based boilerplate removal tool
Python code to extract words and in turn extract letters using pytesseract
Extract all the texts of any project with HTML files and generate a KV (Key-Value) file, key = reference key, value = extracted text.
Arachnio client library for Java 11+
Retrieve data from two different websites, loading them into the PostgreSQL database using Python, and combine them to get and present new information
Tesseract-OCR quick implementation. Linked with stack-overflow question
Extract text content from an HTML page, process it, and extract unique words from the processed text. This notebook utilizes various text processing techniques including cleaning, normalization, tokenization, lemmatization or stemming, and stop words removal.
Api to get text from multiple types of files
A simple web application built with React which allows to upload images containing text, select the language of the text for recognition, and extract the text from the image. As quick as a finger snap - SnapText.
[Thesis] Video Text Extraction
Engine for automated the process of scraping PDFs into local and convert those PDFs into text by performing OCR.
PyQt5를 사용한 간단한 도서 스캐너 프로젝트 (바코드 인식과 텍스트 추출을 통한 도서 정보를 검색 및 표시)
custom github action to parse issue body
Harnesses the power of OpenAI's to revolutionize the way you consume information. Say goodbye to information overload and hello to quick and comprehensive understanding. Let our AI-Powered Content Summarizer extract the key insights from any text, allowing you to focus on what matters most.
AI solution that analyses thousands of typewritten documents in order to solve forced disappearances in Mexico.
The Business Card Reader is a Python application that utilizes computer vision techniques and optical character recognition (OCR) to extract text information from business cards. It provides an intuitive interface to capture an image of the business card, process it, and extract the text for further use.
License plate localizer using pre-trained YOLOv5, combined with text extraction using pre-trained TrOCR
Time Magazine Scraper, Text Extraction (OCR), and Data Exploration with Topic Modelling
2023년 11월 대한산업공학회(UNIST) - Developing data-driven QFD: A systematic approach to employing text information using product manuals, 2저자
Add a description, image, and links to the text-extraction topic page so that developers can more easily learn about it.
To associate your repository with the text-extraction topic, visit your repo's landing page and select "manage topics."