Training TFDF with TFX

armandsauzay · January 18, 2022, 3:47pm

Hi all,

I am trying to set up a GB classifier on TFX for the first time. I am able to run smoothly with the interactive context. However when trying to run it on Kubeflow on GCP, I am having an issue with the transform component. Here are a few questions, which could hopefully solve the problems I am encountering:

What is the recommended environment and/or VM to set up TFDF with TFX?
When creating the pipeline on Kubeflow (with CLI: tfx pipeline create […]), wheels are being created for the transform and trainer component in a /tmp folder in (such as tfx_user_code_Transform-[…]). What purpose are those wheels serving and how to indicate where to store/retrieve them? I dont exactly know why and my set up might be wrong but the Transform component, when ran on Kubeflow, is looking for those wheels in the wrong location ( ings://[…]/[…]/_wheels/[…]). More context is given here
Is there any code snippet or architecture code that we could follow to set up a TFDF model on TFX. Or a estimator.GradientBoostedClassifier that I can’t seem to be able to set up properly.

Any help would be greatly appreciated
Armand

robertcrowe · January 19, 2022, 7:31pm

We have an example of TFDF here:

github.com

tensorflow/tfx/blob/master/tfx/examples/penguin/penguin_pipeline_local.py

# Copyright 2020 Google LLC. All Rights Reserved.
#
# Licensed under the Apache License, Version 2.0 (the "License");
# you may not use this file except in compliance with the License.
# You may obtain a copy of the License at
#
#     http://www.apache.org/licenses/LICENSE-2.0
#
# Unless required by applicable law or agreed to in writing, software
# distributed under the License is distributed on an "AS IS" BASIS,
# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
# See the License for the specific language governing permissions and
# limitations under the License.
"""Penguin example using TFX."""

import datetime
import multiprocessing
import os
import socket
import sys

This file has been truncated. show original

Ed_Park · January 19, 2022, 7:51pm

Hi @Robert_Crowe thanks for posting this. Lines 34- 36 are particularly interesting:

flags.DEFINE_enum('model_framework', 'keras',
                  ['keras', 'flax_experimental', 'tfdf_experimental'],
                  'The modeling framework.')

I’m not familiar with how flags influence TFX - does specifying tfdf_experimental signal something special to either Google AI platform or the VertexAI platform when the pipeline is executed there?

robertcrowe · January 19, 2022, 8:34pm

Nope, it’s just command line flags. If you search through the code you can see how it influences the name of the pipeline and which module file is selected.

armandsauzay · January 21, 2022, 7:00pm

Hi @Robert_Crowe, thanks for your answer.

A couple of follow up questions:

Are you working on a Linux machine to be able to locally develop tfdf models on tfx?
Why is this called tfdf_experimental? I saw that it was difficult to integrate such a model with TFS. Is it still the case?

Thanks again!

armandsauzay · February 17, 2022, 5:44pm

@Robert_Crowe just bumping up this in case you have some information on this. Is there any example of using TFDF on a Kubeflow pipeline? Thanks a lot!

Ed_Park · February 17, 2022, 6:54pm

@armandsauzay - I haven’t tried building the TFDF model introduced in TFX 1.6.0 to the Penguin example (tfx/tfx/examples/penguin at master · tensorflow/tfx · GitHub) to Kubeflow but it worked locally for me:

python penguin_pipeline_local.py --model_framework=tfdf_experimental

Presumably, switching to using penguin_pipeline_kubeflow.py should build the TFDF model to Kubeflow.

Topic		Replies	Views
TensorFlow Decision Forests with TFX (model serving and evaluation) General Discussion models , tfx , tfdf , help_request	34	7167	April 8, 2022
Kernel crashed using tfx and tensorflow-macos on intel mac General Discussion tfx , install , mac_os , gpu , help_request	7	1558	August 24, 2023
TFX custom component pipeline error: AttributeError: module '__main__' has no attribute 'ImageExampleGenExecutor' General Discussion tfx , help_request	3	3295	September 8, 2023
TFX Pipeline on Azure General Discussion tfx , azure , learning , help_request	8	3530	June 20, 2023
TFX Issue: RuntimeError: Failed to apply: CreateSavedModel[tf_v2_only] TFX-Addons models , tfx	4	978	August 16, 2023

Training TFDF with TFX

Related topics