The CHEAT Benchmark

For those interested in issues around agentic AI and assessment, I’m excited to announce the launch of the CHEAT Benchmark (https://cheatbenchmark.org/). The CHEAT Benchmark is an AI benchmark like SWE-Bench Pro or GPQA Diamond, except this benchmark measures an agentic AI’s willingness to help students cheat. By measuring and publicizing the degree of dishonesty of … Read more

Information Age vs Generation Age Technologies for Learning

It is absolutely critical that everyone who cares about technology-mediated learning understand this point. There is a seismic shift in perspective necessary from pre-generative AI technologies to generative AI technologies. It requires changes in the way we think about everything – from pedagogy to supporting infrastructure. I’ve been writing and speaking about this for months … Read more