Cracking the code: This group of U of T computer science researchers are decoding ciphers with AI

Codebreakers: Sheldon Huang, Ivan Zhang, Aidan Gomez, Muhammad Osama and Bryan Li, the FOR.ai research team (photo courtesy of FOR.ai).

Published: October 26, 2017

By Nina Haikara

To break the Enigma code during the Second World War, British computer scientist Alan Turing developed a mathematical model to unlock the cipher faster than any human.

Today, a group of �ؿ�ζSM undergraduate computer science students are decoding encrypted text using a neural network, a framework for machine learning algorithms inspired by the brain.

“We're at a stage [in our research] where we can pretty confidently say that the architecture works, and it's more general than anything that's been previously developed,” says Aidan Gomez, a fourth-year student in the department of computer science. Their accuracy results are above 95 per cent, he adds.

Gomez and the research team Sheldon Huang, Bryan Li, Muhammad Osama and Ivan Zhang, are 2018 fellows of AI Grant, a recently established non-profit that provides select projects nearly $50,000 in cloud computing resources from Google, among others. They also receive exclusive access to a global AI network of mentors including Andrej Karpathy, a U of T alumnus who was formerly of OpenAI and now director of AI at Tesla.

Roger Grosse, an assistant professor in the department of computer science, and Lukasz Kaiser, senior research scientist at Google Brain, help mentor the team.

Similar to natural language translation tasks, their project uses plain text, or English, and cipher text as two different languages. The neural network reads both and makes connections between the two without any additional support in translation.

Gomez says the method is able to crack a much more complicated cipher called Vigenère, historically termed the indecipherable cipher, where a hidden key is only known to the sender and recipient. The key determines an entirely different Caesarian, or shift cipher, to be used at each position; meaning the neural network can no longer simply count the frequencies of letters and perform simple frequency analysis.

Topics

Our Community

�ؿ�ζSM