This example Next.js project evaluates multiple LLM models using the Vercel AI SDK and streams updates to the frontend using Trigger.dev Realtime.
batch.triggerByTaskAndWait
method.evaluateModels
task uses the batch.triggerByTaskAndWait
method to distribute the task to the different LLM models.summarizeEvals
task that calculates some dummy “tags” for each LLM response.AnthropicEval
component: src/components/evals/Anthropic.tsxXAIEval
component: src/components/evals/XAI.tsxOpenAIEval
component: src/components/evals/OpenAI.tsx