The prediction must be made at the moment of interaction (product recommendations during a browsing session, fraud detection at the point of transaction) - The input data is unique to the request and not known in advance (a specific customer's current shopping cart) - Low latency is critical (sub-se