Your LLM call isn't slow. One of the four stages is.
You shipped RAG two weeks ago. The demo was 800ms, production is 4.2 seconds at p95, and the only thing your trace tells you is "LLM call: 3.8s". So you start guessing. Maybe the model is slow today.