How can I optimize GPT-3 4 API usage for generating large text while maintaining context

Question

I am developing a system where users submit essays, and the AI must provide detailed feedback across multiple paragraphs. What strategies should I employ to keep AI aware of the essay's content without hitting token limits?

Ashutosh · Answer 1 · Nov 7, 2024

Best answer

One of the approach is to return the most recent messages in each run to avoid hitting token limits.

To support this point, here is my optimized reference of code that uses the GPT model and not exceeds token limits.

This example estimates tokens roughly by word count( len( prompt. split())). If the prompt and the desired response exceed the model's token limit[token_limit], the prompt is trimmed to fit within the limit. Therefore, you can use this method to optimize Chatgpt 3/4 API usage.

Hence, in this way, you will be able to avoid token overflow and maintain concise responses within the token limit.