eval-driven-dev

分类: 数据与AI | 上传者: yiouliyiouli | 下载: 0 | 版本: v1.0(最新)

编写并运行评估测试,并对失败进行迭代。

更新日志: Source: GitHub https://github.com/yiouli/pixie-qa

目录结构

当前层级: tree/main/skills/eval-driven-dev/

  • 📁 references/
    • 📁 run-harness-examples/
      • 📄 cli-app.md 1.9 KB
      • 📄 fastapi-web-server.md 10.6 KB
      • 📄 standalone-function.md 1.3 KB
    • 📄 1-a-entry-point.md 2.1 KB
    • 📄 1-b-data-flow.md 9.8 KB
    • 📄 1-c-eval-criteria.md 3.8 KB
    • 📄 2-instrument-and-observe.md 8.3 KB
    • 📄 3-run-harness.md 6.3 KB
    • 📄 4-define-evaluators.md 6.3 KB
    • 📄 5-build-dataset.md 13.4 KB
    • 📄 6-run-tests.md 3.5 KB
    • 📄 7-investigation.md 6.6 KB
    • 📄 evaluators.md 15.5 KB
    • 📄 instrumentation-api.md 3.9 KB
    • 📄 testing-api.md 11.3 KB
  • 📁 resources/
    • 📄 run-with-timeout.sh 1.4 KB
    • 📄 setup.sh 1.3 KB
    • 📄 stop-server.sh 1000 B
  • 📄 SKILL.md 12.3 KB

SKILL.md

登录后下载/点赞/收藏 ❤ 5 | ★ 0
评论 0

请先登录后评论。

评论加载中...