Back to ToolsView full documentation and source
promptfoo
Test and evaluate prompts, agents, and RAG pipelines like you test code. Side-by-side evals, CI integration, and red-teaming for LLM apps.
Testing & Evals
TypeScriptCLIEvals
Repository
promptfoo/promptfooLIVE FROM GITHUB
★22,435stars
🍴2,000forks
Updated today
Installation
npx promptfoo@latest initHow to Use
promptfoo is a production-grade tool used by many agent builders. Start with the official installation command above, then follow the official documentation for configuration and advanced usage.
Most tools in this directory work great alongside agent skills and MCP servers across Claude Code, Codex, Grok Build, Gemini CLI, and Cursor.
Quick Links
GitHub Repository
Install with the command shown on the left.
Related Tools
Claude Code
Anthropic's agentic coding CLI. Reads your repo, edits files, runs commands, and ships features end-to-end from the terminal. Supports skills, MCP servers, hooks, and subagents.
OpenAI Codex CLI
OpenAI's open-source coding agent for the terminal. Plans, edits, and runs code in a sandbox with configurable approval modes. Reads AGENTS.md natively.
Grok Build
xAI's terminal coding agent. Plan Mode writes a file-by-file plan before editing, runs up to 8 parallel subagents in isolated git worktrees, and has native MCP support. Reads AGENTS.md and runs headless in CI. Powered by grok-build-0.1.