Dynamic data testing engine based on pySpark
This is a dynamic config based framework for
- Unit testing
- Data value/integrity testing
Framework to ensure data being tested matches conditions (Data type, Value range, etc)
Framework to compare data from 2 sources
SQL server
CSV
Parquet
Blob storage
All codes written on Azure databricks