Hi Joao_Lazzaro,
With the release of new models and old models being deprecated, we have seen a huge bump of daily active users which could be one of the reasons for higher processing times..
Can you please provide some of the long context prompts that was used to test the model along with processing times.. It helps us to compare the results from our end and can successfully rule-out other performance reasons.