I decided that it must be possible to include all the
prompts for a routine sequence of prompts in one rather long prompt.
I wanted to get:
- A student transcript with the errors marked so they could try to see their errors
- A corrected version of the transcript
- An improved version at the students level (A2, in this case)
- An improved version at the next level (B1, in this case)
- An improved version at two levels up (B2, in this case)
The secret was to include the five steps in the prompt but
to instruct GenAI to wait for the prompt “Next” before moving on to the next
step.
I then took a short recording made by a very good A2 student
and used Rev.com to get the transcript. The idea is that the student has to copy
the long prompt from a WhatsApp group, for example, and paste it into the
chosen GenAI and then copy the transcript from Turboscribe.ai or Rev.com and paste it into the same GenAI.
I did this with these 5 types of GenAI:
- ChatGPT
- Gemini
- Claude
- Copilot
- Deepseek
I then copied the output from each GenAI for each step and
pasted it into Pearson’s GSE Text Analyzer ( https://www.english.com/gse/teacher-toolkit/user/textanalyzer
), which gave me GSE and CEFR levels for all 25 versions of the transcript.
Apart from looking carefully at the English used in each
version to try to understand what language had determined the levels given for
them all, I also made an Excel spreadsheet with the two sets of levels. With
these I was able to produce two graphs. The first one shows what happened with
each type of GenAI:
At first glance Claude was the best at producing increasingly
sophisticated versions of the student’s transcript, although they were always
half a CEFR level too high. Mark you, the original transcript was already half
a level higher as she was a very good student.
Equally obvious is the fact that Copilot was useless!
ChatGPT, Gemini and Deepseek failed to produce increasingly sophisticated
versions across the four levels, so they didn’t do what I had intended.
In fact, as can be seen in this second chart, the whole idea
didn’t work on average:
Once again, this may be as a result of the student’s
original transcript being higher than expected (A2+ rather than A2). Maybe the
lesson to be learnt from this is that instead of using fixed levels for each
step up, the different GenAIs may be able to produce new version of the
transcript one level higher on the CEFR scale. This would have the added advantage
that the same long prompt could be useful for all students in a class and
across all courses.
Here is the original version of the long prompt derived with
help from Turboscribe and ChatGPT’s advice. (I hope the concept of including
steps and a trigger word will be useful):
“You will be provided a transcript as well as instructions
about what to do with that transcript. Unless otherwise specified, your
response should be in the same language as the transcript.[1]
Here are your instructions, which you must follow:
Instructions:
- I will provide a transcript. For each step, you should follow the instructions carefully.
- For each task, do not include timestamps unless specifically requested.
- After each step, wait for me to say "next" before proceeding.
Steps:
- Mark the errors in the transcript in bold, but do not correct them.
- Correct the errors, marking the changes in bold and leaving the original errors in brackets.
- Improve the transcript for A2 level students, marking improvements in bold.
- Improve the transcript to a B1 level, marking changes in bold.
- Finally, enhance the transcript to a B2 level, marking improvements in bold.
[Sample transcript of an A2 student’s recording:]
The history start in the summer of 2011. Anna went on
holiday with some friends on island. The photo was taken on hill. Hill is a
little mountain and called Ana. The photo is important for her because the
stone is a mysterious for her. And she put your hands around it, the stone,
and, and she was sleeping. And this photo there are in other places because in
your mobile phone, computer, et cetera.”
[1]
These two sentences came from Turboscribe’s Custom prompt