Series Proving Agent Quality With Data

A series of experiments testing whether specialized AI agents on local models can match cloud API quality for personal task management.