Topics tagged transformers

Topic	Replies	Views	Activity
Gemma 4 27B-A4B-it (MoE) on Vertex AI: vLLM Dependency Triangles, LoRA, and Vision Blockers Gemma models , gemma , vertexai , pytorch , transformers	1	268	April 27, 2026
Tf.keras.Model and model class adaptation General Discussion datasets , keras-layer , tfkeras , tfdata , transformers	1	326	February 23, 2026
OpenAI’s GPT-2 param run model General Discussion nlp , transformers	1	1295	September 25, 2025
Optimizing seq2seq decoding script Keras models , getting_started , education , help_request , transformers	1	539	July 25, 2025
Tensorflow ver 2.17.0 Gemini API transformers	1	74	July 21, 2025
Help! When building or training model get error: "ValueError: The first argument to `Layer.call` must always be passed. " General Discussion transformers	1	2761	May 23, 2025
Keras Transformer Implementation Error Keras models , transformers	1	90	January 16, 2025
From tensorflow.keras.wrappers.scikit_learn import KerasClassifier General Discussion scikeras , keras , transformers	6	13406	April 15, 2024
Fine-tuning GPT2 for text summary Keras keras_nlp , transformers	1	841	December 27, 2024
I have been training a decoder based transformer for word generation. But it keeps generating the same words over and over again Keras api , help_request , transformers	1	682	December 20, 2024
Create_padding_mask in Transformer code uses encoder input sequence for creating padding mask in 2nd attention block of the decoder General Discussion models , help_request , transformers	1	1299	December 12, 2024
Apply a traied model with tensorflow on transformer pipeline pop out error General Discussion models , keras , transformers	1	864	November 22, 2024
How to calculate BLUE score, precision, recall, calibration, confusion matrix for transformer? General Discussion transformers	1	366	October 11, 2024
GPT NEO-For COVID-19 Question Answering TensorFlow nlp , transformers	1	1608	October 10, 2024
Exception encountered when calling layer 'softmax' (type Softmax) Keras transformers	1	508	September 19, 2024
Masking propagation through layers Keras models , nlp , transformers	1	777	September 19, 2024
T5 fine-tuned model: one method ignores min_target_length parameter while one does not General Discussion models , transformers	1	384	August 23, 2024
Issue with Deserializing a Custom Transformer Model in TensorFlow Keras tfconfig , transformers	1	544	May 20, 2024
RESOURCE_EXHAUSTED when running TimeDistributed on MultiHeadAttention TensorFlow models , tfkeras , transformers	1	471	January 29, 2024
How do I use sentence-transformers/all-MiniLM-L6-v2 tflite model in android studio (kotlin) General Discussion tflite , transformers	1	1711	January 23, 2024
Getting very less accuracy in vision transformer Keras transformers	0	427	October 17, 2023
What is the model suitable for time series forecasting? General Discussion help_request , transformers	2	637	October 13, 2023
Call Tensorflow Model in a loop leaks memory General Discussion nlp , keras , transformers	1	1464	September 25, 2023
Issue with HuggingFace psuh_to_hub General Discussion nlp , keras , transformers	1	980	June 20, 2023
Though Training accuracy is high performance on training data during inference in transformer translation is poor General Discussion models , transformers	0	641	June 9, 2023
How Hugging Face improved Text Generation performance with XLA Show and Tell keras , xla , transformers	1	996	June 8, 2023
How to extract body of a transformer like models and fine tune with that body on different data TensorFlow models , transformers	2	534	June 5, 2023
Does TransformerEncoder layer accept built-in mask? General Discussion api , keras , transformers	1	805	May 8, 2023
Save and restore transformer model General Discussion models , keras , transformers	1	1147	March 18, 2023
Main transformers use-cases and insights General Discussion transformers	0	758	February 7, 2023