Best Practices

To get the most out of the Serika.dev API, follow these best practices.

1. Handle Errors Gracefully

Always wrap your API calls in try-catch blocks to handle potential errors like rate limits or timeouts.

from openai import APIError

try:
    response = client.chat.completions.create(...)
except APIError as e:
    print(f"OpenAI API returned an API Error: {e}")

2. Use Streaming for Long Responses

For chat applications, streaming provides a much better user experience by showing the response as it’s being generated.

3. Manage Context Window

Be mindful of the token limit. If a conversation gets too long, you may need to summarize previous messages or remove older ones to fit within the model’s context window.

4. Secure Your API Keys

Never expose your API keys in client-side code (browsers, mobile apps). Always route requests through your own backend server.

5. Use the Jobs API for Heavy Tasks

For complex image generation tasks, especially with TensorArt models, use the Jobs API. This prevents HTTP timeouts and ensures reliable delivery of results.