skip to Main Content

Since I’m moving away from pandas DataFrames to TensorFlow datasets, I’d like to use tensorflow-data-validation instead of the more traditional pandas-profiling when it comes to data exploration and validation.

However, pip install tensorflow-data-validation gives the following error:

ERROR: Could not find a version that satisfies the requirement tensorflow-data-validation (from versions: none)
ERROR: No matching distribution found for tensorflow-data-validation

What could be the problem? This old GitHub issue explains how this could be due to the Python version, but Apache Beam (on which tensorflow-data-validation presumably relies) is now fully compatible with Python 3, so it must be something else.

My environment is as follows:

  • Python 3.9.2
  • TensorFlow 2.6.0
  • Debian GNU/Linux 11 (bullseye)
  • pip 21.3

4

Answers


  1. Try this

    pip install --upgrade --force-reinstall tensorflow-data-validation[all]
    
    Login or Signup to reply.
  2. It might be a version compatibility issue with tensorflow==2.6.0.
    Try

    pip install tensorflow-data-validation==1.3.0
    
    Login or Signup to reply.
  3. I got the same error when using Python 3.9. After downgrading to Python 3.8, pip install tensorflow-data-validation ran successfully.

    Regarding your comment about Apache Beam, it looks like the Python SDK currently supports Python 3.8 (and earlier) but not yet Python 3.9.

    My environment:

    • Python 3.8.10
    • TensorFlow 2.8.0
    • macOS Monterey (12.0.1)
    • pip 21.1.1
    Login or Signup to reply.
  4. I was able to install the tensorflow_data_validation library successfully, via the below command in my Google Colab file.

    !pip install -U tensorflow 
                tensorflow-data-validation 
                apache-beam[gcp]
    
    Login or Signup to reply.
Please signup or login to give your own answer.
Back To Top
Search