Performance Benchmarks 2025
Report type: Platform latency benchmarks
Date: 2025-10-01
Summary
- Median response processing time measured under 2 seconds across standard workloads.
- Benchmarks include mixed retrieval and generation workloads.
Key Data
- Median response processing time under 2 seconds across standard workloads.
- Benchmarks include mixed retrieval and generation workloads.
Sources
- Real-Time API Performance Benchmarks 2025
- Chat Data performance benchmark harness.