Skip to content
AIAn Alian Software company

AI glossary

Streaming

Returning the model's response token-by-token as it's generated, instead of waiting for the full reply. Critical for chat UX — feels responsive even when the full response takes seconds.

Want to talk about how this applies to your stack?