Framework for processing and filtering datasets
-
Updated
Jun 2, 2024 - Python
Framework for processing and filtering datasets
Official implementation of our paper "Finetuned Multimodal Language Models are High-Quality Image-Text Data Filters".
Data Science Foundations I | Exploratory Data Analysis in Python | Summarizing Single Feature
Data Science Foundations I | Exploratory Data Analysis in Python | Inspect, Clean, and Validate a Dataset | EDA: Diagnosing Diabetes
This software anonymises data inside text files and CSV-like files. It removes various sorts of personally identifiable information. Each removed part is replaced with a suitable generic text, depending on the type of removed data. Currently English and Russian languages are supported. Russian works both with Cyrillic and Latin characters.
This sample illustrates how to create and register custom functions (Filter Control, Filter Editor).
PaveVibe: A georeferenced device leveraging ESP32, accelerometer, and GPS data to measure and map pavement quality, identifying defects for maintenance and infrastructure improvement.
2021 Java practice project focused on file reading and data processing. It includes functions for custom exception handling, data conversion into objects, and basic filtering of records based on specific criteria. A practice of Java fundamentals
DSIR large-scale data selection framework for language model training
Building Football Team Card
CDC Connect is a cross-platform mobile application built in React Native using JavaScript. The app is designed for data collection with a focus on surveys.
⏳ Provide filtering, sanitizing, and conversion of Golang data. 提供对Golang数据的过滤,净化,转换。
Process datasets, and then outputting information to the screen with Javascript.
This repository contains a Python script that allows you to filter data in an Excel file using Streamlit, a web application framework for Python. The script utilizes the pandas library for data manipulation.
An intuitive GUI-based Python application allowing a user to easily extract data from a file based on specific keywords to generate a focused output file.
This is a presentation that I did for R-Ladies Gaborone 😀
Weka Comparator to match rules to test data with filtering abilites
I have been tasked with conducting an exploratory data analysis (EDA) on a dataset provided by a client. My objective is to extract initial insights from the dataset that can be used for further analysis. This dataset contains information on several movies, including titles, descriptions, genres, durations, and more.
Filter & Fetch Dynamically Data
Highlight search results in an in-place RTF editor.
Add a description, image, and links to the data-filtering topic page so that developers can more easily learn about it.
To associate your repository with the data-filtering topic, visit your repo's landing page and select "manage topics."