AI glossary

Streaming

Returning the model's response token-by-token as it's generated, instead of waiting for the full reply. Critical for chat UX — feels responsive even when the full response takes seconds.

Want to talk about how this applies to your stack?

Book a 20-min call →Browse all terms

More terms

Agent
Agentic workflow
BAA (Business Associate Agreement)
Cache (prompt caching)
Citations / grounding
Context window

4.3
99% Job Success
Top rated Plus
4.5
4.9
5.0
4.0