T5 fine-tuned model: one method ignores min_target_length parameter while one does not

Seungjun_Lee · July 8, 2023, 6:22am

I created a text summary model by fine-tuning t5 model, and while inferencing I got a one question.

Following two methods were given same input text, min_target_length and max_target_length but two are yielding different result in terms of length.

Here method1 follow my parameter setting. If I set the min_target_length to 64 and max_target_length 256(same setting I used for training) it does generate summary length of 180.

but method2 always generate one sentence summary, no matter how I set the parameter.

Can anyone know me why this is happening?. Thanks

method1

from transformers import pipeline

summarizer = pipeline("summarization", model=model, tokenizer=tokenizer, framework="tf")

summarizer(
    raw_datasets["train"][0]["document"],
    min_length=MIN_TARGET_LENGTH,
    max_length=MAX_TARGET_LENGTH,
)

method2

inputs = ["summarize: " + text]
inputs = tokenizer(inputs, max_length=MAX_INPUT_LENGTH, truncation=True, return_tensors="tf")
output = model.generate(**inputs, num_beams=8, do_sample=True, min_length=64, max_length=256)
decoded_output = tokenizer.batch_decode(output, skip_special_tokens=True)[0]
predicted_summary = nltk.sent_tokenize(decoded_output.strip())[0]
print(predicted_summary)

Kiran_Sai_Ramineni · August 23, 2024, 9:46am

Hi @Seungjun_Lee, In the method2 the reason for getting one sentence summary might be due to this line of code nltk.sent_tokenize(decoded_output.strip())[0]. For example, let us assume the decode_output as

decode_output=' text='sentence1. sentence2. sentence3'

you have applied strip method on decoded output and pass it to nlkt sentence tokenize

nltk.sent_tokenize(decode_output.strip())[0] #output:'sentence1.'

This might be the reason for getting a summary as one sentence. Thank You.

Topic		Replies	Views
Apply a traied model with tensorflow on transformer pipeline pop out error General Discussion models , keras , transformers	1	806	November 22, 2024
Fine-tuning GPT2 for text summary Keras keras_nlp , transformers	1	803	December 27, 2024
Finetuned GPT2 Summary model generates from very begining not from 'Summary' General Discussion models , datasets	2	513	June 28, 2023
With `with strategy.scope():` BERT output loses it's shape from tf-hub and `encoder_output` is missing TensorFlow distributed-training , tf-hub	0	588	December 15, 2022
I have been training a decoder based transformer for word generation. But it keeps generating the same words over and over again Keras api , help_request , transformers	1	644	December 20, 2024

T5 fine-tuned model: one method ignores min_target_length parameter while one does not

Related topics