Multiple output values - predictions nowhere near what I was expecting

Huw_Thomas · June 5, 2023, 2:49am

Hi,

I’m trying to get the sentiment + polarity from some sentences using the code below. I want to use the universal multilingual vectors to get the vectors so that I can explore training on one language and applying to multiple languages.

I know it’s a very small data set for the example, but I was expecting that running model.predict on a sentence it was trained on would give the correct results (or very similar). The last couple of lines below show what I was expecting and what I’m getting.

Does anyone know what I’m doing wrong?

Thanks,
Huw

import tensorflow as tf
import tensorflow_hub as hub
import numpy as np

Load the universal sentence encoder multilingual

module = hub.load(‘Google | universal-sentence-encoder | Kaggle’)

def get_sentences_vectors(sentences, batch: int = 1000):
lower = 0
upper = batch
sent_vectors = module.signatures’question_encoder’[“outputs”]
while upper < len(sentences):
lower += batch
upper += batch
print(lower)
if sentences:
sent_vectors = np.concatenate(
(sent_vectors, module.signatures’question_encoder’[“outputs”]))
return sent_vectors

sentences = [
“I really like this coat”,
“I really hate this coat”,
“I really like this jacket”,
“I really hate this jacket”,
“I really like this cat”,
“I really hate this cat”,
“I really like this dog”,
“I really hate this dog”,
“I really like this spaniel”,
“I really hate this spaniel”,
“I really like this terrier”,
“I really hate this terrier”,
“I really like this chair”,
“I really hate this chair”,
“I really like this beer”,
“I really hate this beer”,
“I really like this food”,
“I really hate this food”,
“I really like this chicken”,
“I really hate this chicken”,
]

subjectivity_scores = [0.2, 0.9, 0.2, 0.9, 0.2, 0.9, 0.2, 0.9, 0.2, 0.9,
0.2, 0.9, 0.2, 0.9, 0.2, 0.9, 0.2, 0.9, 0.2, 0.9]
polarity_scores = [0.2, -0.8, 0.2, -0.8, 0.2, -0.8, 0.2, -0.8, 0.2, -0.8,
0.2, -0.8, 0.2, -0.8, 0.2, -0.8, 0.2, -0.8, -0.2, -0.8]

sentences_vectors = get_sentences_vectors(sentences)
sentence_vector_size = 512

Split the dataset into training, validation, and test sets

train_size = int(0.8 * len(sentences))
val_size = int(0.1 * len(sentences))
test_size = len(sentences) - train_size - val_size

train_sentences = np.asarray(sentences_vectors[:train_size])
train_subjectivity = np.asarray(subjectivity_scores[:train_size])
train_polarity = np.asarray(polarity_scores[:train_size])

val_sentences = np.asarray(sentences_vectors[train_size:train_size + val_size])
val_subjectivity = np.asarray(subjectivity_scores[train_size:train_size + val_size])
val_polarity = np.asarray(polarity_scores[train_size:train_size + val_size])

test_sentences = np.asarray(sentences_vectors[train_size + val_size:])
test_subjectivity = np.asarray(subjectivity_scores[train_size + val_size:])
test_polarity = np.asarray(polarity_scores[train_size + val_size:])

Define the model architecture

model = tf.keras.Sequential([
tf.keras.layers.Dense(64, activation=‘relu’, input_shape=(sentence_vector_size,)),
tf.keras.layers.Dense(32, activation=‘relu’),
tf.keras.layers.Dense(2, activation=‘linear’) # 2 outputs: subjectivity and polarity
])

Compile the model

model.compile(optimizer=‘adam’, loss=‘mse’)

Train the model

model.fit(train_sentences, [train_subjectivity, train_polarity], epochs=10, batch_size=32,
validation_data=(val_sentences, [val_subjectivity, val_polarity]))

Evaluate the model on the test set

loss = model.evaluate(train_sentences, [train_subjectivity, train_polarity])

Extract subjectivity and polarity predictions

vectors = get_sentences_vectors([“I really hate this coat”]).numpy()
predictions = model.predict(vectors)
subjectivity_predictions = predictions[0][0]
polarity_predictions = predictions[0][1]
print(f"subjectivity_predictions : {subjectivity_predictions}“) # Thought this would return (0.9) - gives 0.31
print(f"polarity_predictions : {polarity_predictions}”) # Thought this would return (-0.8) - gives 0.35

Topic		Replies	Views
Next in the tutorial - Sentiment Analysis Show and Tell lstm , load-test	2	52	August 24, 2024
Text-based Tensorflow unexpected result of train_function (empty logs) General Discussion models , help_request , keras , tfdata , nlp	5	6813	July 27, 2022
How to jointly predict a sequence and its associated scoremo Keras keras , help_request , models	1	1466	January 18, 2024
Trained model outputs incoherent sentences Keras tfkeras , model-training	1	166	March 21, 2024
Getting NaN for loss General Discussion datasets , models , keras , help_request	3	28549	November 9, 2021

Multiple output values - predictions nowhere near what I was expecting

Load the universal sentence encoder multilingual

Split the dataset into training, validation, and test sets

Define the model architecture

Compile the model

Train the model

Evaluate the model on the test set

Extract subjectivity and polarity predictions

Related topics