Don’t announce rules that you cannot enforce
We would love to be able to tell our students exactly what they can and cannot do regarding using AI, and we know that our students also really want clear guidelines so they do not accidentally cheat. But, as Corbin et al. (2025) write, “The more detailed and specific our instructions become about ‘acceptable’ AI use, the more we highlight the gap between what we can specify and what we can verify”. When we try to code everything a student could do in relation to learning into the traffic light of red – forbidden, green – ok, and yellow – it depends, we run into several problems:
- The underlying assumption — that students will understand what exactly is permitted and what is not — is not valid. The lines between, for example, drafting and editing, are blurry: what if a suggested edit brings out an argument that we hadn’t thought about explicitly before but now we think we should stress it more? Where is the boundary between having a question explained and getting hints for how to approach it? What counts as “refining” an argument: correcting grammar but not alterning meaning, finding errors but not helping towards a solution?
- Students need to voluntarily follow the rules, even when there are disagreements (or misunderstanding, as above) between teachers and students about what is legitimate use of GenAI for learning or in assessment.
- Compliance cannot be enforced because there is no way to reliably detect GenAI use (other than leaving in parts of a response like “Sure, here is the response to the question …”, the teacher directly observing a student using GenAI, or the student admitting to it).
So while the traffic light analogy seemed appealing at first for how easily it signals to students what’s ok and what is not, traffic lights in real traffic are substantially different. They exist in much larger structures with speed cameras, police patrols, fines, etc, so rule violations can actually be detected and compliance enforced.
To deal with the challenge, Corbin et al. (2025) suggest a “two-lane approach”, where lane 1 is controlled, in-person assessment, and lane 2 is open in the sense that there is no attempt to control how students use GenAI. But lane 2 can of course be structured in a way to discourage misuse, like having someone sign off on data produced in a lab so that students have to continue with that data set rather than generating synthetic data; or checkpoints where students discuss their work and then asynchronously build on the ideas discussed with the instructor.
Corbin et al. (2025) conclude that “The path forward through this increasingly challenging terrain [of increasing AI capabilities] lies not in more sophisticated rules about AI use, but in fundamentally redesigning how we structure assessments to demonstrate student capability. This will require significant effort and creativity from educators but has the advantage of allowing for genuine solutions to maintaining assessment validity in an AI-enabled world. These must be solutions based not on what we say, but on what we do.”
So – how do we design cheat-proof assessment? There are of course attempts like going back to pen-and-paper (but can we detect AI glasses? And is writing by hand under time pressure really an inclusive and accessible way to assess learning?) or oral assessments (what about test anxiety or teacher biases?). But it is a slippery slope where we might slide from teaching into policing.
Read more:
- Corbin, T., Dawson, P., & Liu, D. (2025). Talk is cheap: why structural assessment changes are needed for a time of GenAI. Assessment & Evaluation in Higher Education, 50(7), 1087–1097. https://doi.org/10.1080/02602938.2025.2503964
An earlier version of this post was first published by Mirjam Glessmer on
https://mirjamglessmer.com/2026/04/13/currently-reading-corbin-et-al-2025-on-talk-is-cheap-why-structural-assessment-changes-are-needed-for-a-time-of-genai/ and is reposted here with permission as an example for possible content on this blog.
Mirjam Glessmer · 13 Jun 2026
