Since I’m moving away from pandas DataFrames to TensorFlow datasets, I’d like to use tensorflow-data-validation
instead of the more traditional pandas-profiling
when it comes to data exploration and validation.
However, pip install tensorflow-data-validation
gives the following error:
ERROR: Could not find a version that satisfies the requirement tensorflow-data-validation (from versions: none)
ERROR: No matching distribution found for tensorflow-data-validation
What could be the problem? This old GitHub issue explains how this could be due to the Python version, but Apache Beam (on which tensorflow-data-validation
presumably relies) is now fully compatible with Python 3, so it must be something else.
My environment is as follows:
- Python 3.9.2
- TensorFlow 2.6.0
- Debian GNU/Linux 11 (bullseye)
- pip 21.3
4
Answers
Try this
It might be a version compatibility issue with tensorflow==2.6.0.
Try
I got the same error when using Python 3.9. After downgrading to Python 3.8,
pip install tensorflow-data-validation
ran successfully.Regarding your comment about Apache Beam, it looks like the Python SDK currently supports Python 3.8 (and earlier) but not yet Python 3.9.
My environment:
I was able to install the tensorflow_data_validation library successfully, via the below command in my Google Colab file.