Regardless of where you sit on the “should you use an AI detector or not” debate, it’s important to understand how it works.
This might be surprising to some but how they work is very similar to how an instructor reads and analyzes a paper to see if it is plagiarized. After all, we’ve trained AI on human behaviour and human-generated data.
Every AI detector is a little different but generally speaking, they follow this structure:
AI detectors start first by reading the text, similar to what a human grader would do. They examine the vocabulary used, sentence structure, and any unique writing styles. Here are some of the ways that AI detectors can detect patterns:
AI detectors are also trained with datasets of human-written text and AI-generated text. The training data will help the detector-in-training understand the patterns that human writing exhibits and the ones that AI-generated text tends to exhibit such as the patterns we mentioned above.
For example, perhaps in the training data, AI-generated text contains more sentences of similar n-grams like the following:
"The geese are honking loudly in the early morning. The geese are flying together over the lake. The geese are landing gracefully on the water. The geese are foraging for food along the shore. The geese are resting under the trees."
Versus a more inconsistent pattern:
"The geese are honking loudly as they fly over the lake. Geese land gracefully on the water, searching for food. Under the trees, the geese rest, enjoying the shade and the cool breeze."
While this is an oversimplification of how AI detection works, you can already see how it is trained and where bias can creep in.
What if students are taught to write in a certain way?
What if students use vocabulary that is overly complex for that grade level?
What if AI-generated text now uses a variety of n-grams to form sentences?
Enhancing education with AI-powered grading and feedback.