Open source

The tools we built, in the open.

The eval and agent infrastructure behind Studio, released for the teams doing the same work. MIT-licensed, used in production.

11
public repos
3.4k
total stars
28
contributors
MIT
licensed
Flagship
Pinned · most-used
whitescroll/evalkit

The measurement spine for AI-native engineering teams — harness, fixtures, CI reporters, and regression gates. The same eval discipline we install on every engagement, in a library.

$ npm i @whitescroll/evalkit
Stars★ 1,240
LanguageTypeScript
LicenseMIT
Used inevery Studio build
Updated2 days ago