2. How it works (Standard)

The user interacts with the demo app (travel advisor) on port 30100. The app is monitored either via native OpenTelemetry.

The user enters a destination (eg. Sydney):

  • The application first checks the cache.
    • If a response for Sydney is found, the response is returned from the cache.
    • If a cached response is not available, the application requests advice from the LLM (OpenAI's ChatGPT).
  • The response is returned and cached so that subsequent calls for the same destination (eg. Sydney) are served from the cache. This saves roundtrips to ChatGPT and thus $.