Hi there!
I am trying out TFX pipelines, and the first step is to ingest data. In this simple example I am ingesting data from a local (google drive) csv file with CsvExampleGen, following the book "Building Machine Learning Pipelines, publish by O’reilly. However, it has proven difficult to simply import the csv-file. I don’t get an error, but the artifact is empty.
Here is the code snipped:
from tfx.components import CsvExampleGen
os.chdir("/content/gdrive/MyDrive/TFXnotebooks")
base_dir = os.getcwd()
data_dir = os.path.join(base_dir, "data/")
context = InteractiveContext()
example_gen = CsvExampleGen(input_base= data_dir)
context.run(example_gen)
I’ve checked with os.listdir(data_dir)
that the file is indeed there.
Note: I know in many sources it says to import external_input
from tfx.utils.dsl_utils
, but this method is not supported any longer (the module doesn’t exist anymore).
Hope someone can help,
Thanks