Skip to content
#

datavalidation

Here are 28 public repositories matching this topic...

data-observability-installer

Installer for DataKitchen's Open Source Data Observability Products. Data breaks. Servers break. Your toolchain breaks. Ensure your team is the first to know and the first to solve with visibility across and down your data estate. Save time with simple, fast data quality test generation and execution. Trust your data, tools, and systems end to end.

  • Updated May 20, 2024
  • Python
dataops-testgen

DataOps TestGen is part of DataKitchen's Open Source Data Observability. DataOps TestGen delivers simple, fast data quality test generation and execution by data profiling,  new dataset screening and hygiene review, algorithmic generation of data quality validation tests, ongoing testing of new data refreshes, & continuous data anomaly monitoring

  • Updated May 20, 2024
  • Python

CSV Data Validator is a tool to validate csv file. It parse csv and validate the data with .hdr(csv meta data) before ingestion to Data Lake. It checks data file availability for every day load and validate data with respective meta data like File Size, Checksum, Delimiter, Record count etc. It ensure landed data conformity before give go ahead …

  • Updated Jan 6, 2019
  • Java

The main purpose of this repository is to build the pipeline for training of regression models and predict the compressive strength of concrete to reduce the risk and cost involved in discarding the concrete structures when the concrete cube test fails.

  • Updated Feb 27, 2023
  • Python

Improve this page

Add a description, image, and links to the datavalidation topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the datavalidation topic, visit your repo's landing page and select "manage topics."

Learn more