How to change seq length in BERT preprocessor from TF Hub

Kay_Seshagopal · August 26, 2021, 3:21am

I am following this example to use BERT for sentiment classification.

text_input = tf.keras.layers.Input(shape=(), dtype=tf.string)
preprocessor = hub.KerasLayer(
    "https://tfhub.dev/tensorflow/bert_en_uncased_preprocess/3")
encoder_inputs = preprocessor(text_input)
encoder = hub.KerasLayer(
    "https://tfhub.dev/tensorflow/bert_en_uncased_L-12_H-768_A-12/4",
    trainable=True)
outputs = encoder(encoder_inputs)
pooled_output = outputs["pooled_output"]      # [batch_size, 768].
sequence_output = outputs["sequence_output"]  # [batch_size, seq_length, 768].
embedding_model = tf.keras.Model(text_input, pooled_output)sentences = tf.constant(["(your text here)"])print(embedding_model(sentences))

The sequence length by default seems to 128 from looking at the output shape from encoder_inputs. However, I’m not sure how to change this? Ideally I’d like to use to a larger sequence length.

There’s an example of modifying sequence length from the preprocessor page, but I’m not sure how to incorporate this into the functional model definition I have above? I would greatly appreciate any help with this.

lgusm · August 26, 2021, 11:44am

Hi Kay,

If you look into this tutorial: Solve GLUE tasks using BERT on TPU | Text | TensorFlow

You’ll get this exactly information to help you. To be more exact, in the make_bert_preprocess_model function

Topic		Replies	Views
NLP : Question while working with ALBERT General Discussion nlp , tflite , tfhub , help_request	2	1100	August 4, 2021
Tensorflow dataset has () shape General Discussion models , nlp , datasets , help_request	1	2308	May 12, 2022
With `with strategy.scope():` BERT output loses it's shape from tf-hub and `encoder_output` is missing TensorFlow distributed-training , tf-hub	0	588	December 15, 2022
[Help!]Using pretrained Embeddings on TPU TensorFlow tpu	4	563	August 24, 2023
Shape of the integer sequences in TextVectorization layer General Discussion api , keras , help_request	6	898	June 5, 2022

How to change seq length in BERT preprocessor from TF Hub

Related topics