eval-driven-dev

Category: Data & AI | Uploader: yiouliyiouli | Downloads: 0 | Version: v1.0(Latest)

write and run eval tests, and iterate on failures.

Changelog: Source: GitHub https://github.com/yiouli/pixie-qa

Directory Structure

Current level: tree/main/skills/eval-driven-dev/

  • 📁 references/
    • 📁 run-harness-examples/
      • 📄 cli-app.md 1.9 KB
      • 📄 fastapi-web-server.md 10.6 KB
      • 📄 standalone-function.md 1.3 KB
    • 📄 1-a-entry-point.md 2.1 KB
    • 📄 1-b-data-flow.md 9.8 KB
    • 📄 1-c-eval-criteria.md 3.8 KB
    • 📄 2-instrument-and-observe.md 8.3 KB
    • 📄 3-run-harness.md 6.3 KB
    • 📄 4-define-evaluators.md 6.3 KB
    • 📄 5-build-dataset.md 13.4 KB
    • 📄 6-run-tests.md 3.5 KB
    • 📄 7-investigation.md 6.6 KB
    • 📄 evaluators.md 15.5 KB
    • 📄 instrumentation-api.md 3.9 KB
    • 📄 testing-api.md 11.3 KB
  • 📁 resources/
    • 📄 run-with-timeout.sh 1.4 KB
    • 📄 setup.sh 1.3 KB
    • 📄 stop-server.sh 1000 B
  • 📄 SKILL.md 12.3 KB

SKILL.md

Login to download/like/favorite ❤ 5 | ★ 0
Comments 0

Please login before commenting.

Loading comments...