Issue Description
Adds an evaluation dataset creator skill that guides users through a structured interview process to generate comprehensive test datasets for AI agents. The implementation should be based best practices used with customers.
Additional Context
No response