IR-A00: Assignment 0 — Warmup
Overview
Familiarization with GitHub Classroom workflow. Implement a TREC run file parser.
Task: Implement load_run_file function in modules/dataset.py to parse TREC-format run files.
TREC run file format: Each line: queryID Q0 docID rank score runID
Key details:
- Individual assignment, PASS/FAIL (must pass 100% of tests)
- Python 3.11, uses pytest for testing
- Submit via git push to main branch
Takeaway
The TREC run file format is the standard way to represent ranked retrieval results in IR evaluation. Understanding this format is essential for all subsequent assignments.