braintrustdata/autoevals
AutoEvals is a tool for quickly and easily evaluating AI model outputs using best practices.
AutoEvals is a tool for quickly and easily evaluating AI model outputs using best practices.
Evaluate your LLM-powered apps with TypeScript.
Parametric cad modeling with Javascript
Local-first, open-source alternative to Anthropic's Claude Design.
⌘K is a command menu React component that can also be used as an accessible combobox. You render items, it filters and sorts them automatically. ⌘K supports a fully composable API, so you can wrap items in other components or even as static JSX.
PgQue – Zero-bloat Postgres queue. One SQL file to install, pg_cron to tick.
CLI proxy that reduces LLM token consumption by 60-90% on common dev commands. Single Rust binary, zero dependencies
Stagehand is a browser automation framework used to control web browsers with natural language and code. By combining the power of AI with the precision of code, Stagehand makes web automation flexible, maintainable, and actually reliable.
The free AI already on your Mac. CLI tool, OpenAI-compatible server, and interactive chat — all on-device via Apple Intelligence.
Auto-pilot your Claude Code. Add a local LLM brain that learns from your actions and auto-approves/denies
The Zero-Server Code Intelligence Engine - GitNexus is a client-side knowledge graph creator that runs entirely in your browser. Drop in a GitHub repo or ZIP file, and get an interactive knowledge graph with a built in Graph RAG Agent. Perfect for code exploration
Write HTML. Render video. Built for agents.
an embedded graph database built for query speed and scalability. Ladybug is optimized for handling complex analytical workloads on very large databases and provides a set of retrieval features, such as a full text search and vector indices.
a JavaScript library aimed at visualizing graphs of thousands of nodes and edges
A high-performance, eBPF-based network traffic analyzer written in Rust.