pinchbench

Category: Tools & Productivity | Uploader: pinchbenchpinchbench | Downloads: 0 | Version: v1.0(Latest)

Run PinchBench benchmarks to evaluate OpenClaw agent performance across real-world tasks. Use when testing model capabilities, comparing models, submitting benchmark results to the leaderboard, or checking how well your OpenClaw setup handles calendar, email, research, coding, and multi-step workflows.

Changelog: Source: GitHub https://github.com/pinchbench/skill

Directory Structure

Current level: tree/main/

  • 📁 .github/
    • 📁 workflows/
      • 📄 lint.yml 571 B
    • 📄 benchmark-models.yml 1.7 KB
  • 📁 assets/
    • 📄 ai_blog.txt 4.5 KB
    • 📄 company_expenses.xlsx 5.9 KB
    • 📄 GPT4.pdf 7.0 MB
    • 📄 OpenClaw Agent Use Cases and Gap Analysis for PinchBench.pdf 74.1 KB
    • 📄 quarterly_sales.csv 1.3 KB
  • 📁 scripts/
    • 📄 benchmark.py 25.8 KB
    • 📄 lib_agent.py 35.8 KB
    • 📄 lib_grading.py 19.2 KB
    • 📄 lib_tasks.py 6.7 KB
    • 📄 lib_upload.py 14.0 KB
    • 📄 lint_argparse_help.py 2.3 KB
    • 📄 run.sh 250 B
  • 📁 tasks/
    • 📄 task_00_sanity.md 1.5 KB
    • 📄 task_01_calendar.md 3.7 KB
    • 📄 task_02_stock.md 3.6 KB
    • 📄 task_03_blog.md 3.9 KB
    • 📄 task_04_weather.md 4.4 KB
    • 📄 task_05_summary.md 8.5 KB
    • 📄 task_06_events.md 4.0 KB
    • 📄 task_07_email.md 3.9 KB
    • 📄 task_08_memory.md 5.4 KB
    • 📄 task_09_files.md 3.9 KB
    • 📄 task_10_workflow.md 7.8 KB
    • 📄 task_11_clawdhub.md 3.2 KB
    • 📄 task_12_skill_search.md 4.3 KB
    • 📄 task_13_image_gen.md 6.1 KB
    • 📄 task_14_humanizer.md 4.0 KB
    • 📄 task_15_daily_summary.md 13.2 KB
    • 📄 task_16_email_triage.md 25.8 KB
    • 📄 task_17_email_search.md 30.3 KB
    • 📄 task_18_market_research.md 11.1 KB
    • 📄 task_19_spreadsheet_summary.md 10.1 KB
    • 📄 task_20_eli5_pdf_summary.md 6.1 KB
    • 📄 task_21_openclaw_comprehension.md 5.4 KB
    • 📄 task_22_second_brain.md 8.1 KB
    • 📄 task_24_polymarket_briefing.md 6.6 KB
    • 📄 TASK_TEMPLATE.md 8.6 KB
  • 📁 tests/
    • 📄 test_lib_grading.py 1.7 KB
  • 📄 .gitignore 616 B
  • 📄 .pre-commit-config.yaml 318 B
  • 📄 crab.txt 3.4 KB
  • 📄 Dockerfile.benchmark 923 B
  • 📄 LICENSE 1.0 KB
  • 📄 pinchbench.png 592.5 KB
  • 📄 pyproject.toml 678 B
  • 📄 README.md 5.9 KB
  • 📄 SKILL.md 4.2 KB

SKILL.md

Login to download/like/favorite ❤ 918 | ★ 0
Comments 0

Please login before commenting.

Loading comments...