With each prompt I get a response as well as METADATA information.
At the prompt I can count the characters and see how many tokens were needed.
So I find some relation which I notice is about x6 times different from the relation of received tokens and characters
Usually about 4 characters spend one token on average, but for me, since they are in Cyrillic, the ratio is x2 more. However, as far as I can see this is only true for sent tokens. But that doesn’t matter because I’m comparing the ratio of sent/receive tokens. That way, there shouldn’t be much of a difference.
Prompt Ratio / Generated Ration =
The tokens are taken from the metadata, and I count the symbols in the program. So above formula is stable. Sometimes we get more information that I have not included as citations, but there the information is still insignificant
I don’t know where this large expenditure of received tokens comes from.
Also, the generated tokens are much more expensive than the prompt ones.
I continue to argue that there is an unfair accounting of expensive tokens to generate.
I would appreciate it if someone could do the same test.