重口味SM

Cracking the code: This group of U of T computer science researchers are decoding ciphers with AI

Photo of undergraduate codebreakers
Codebreakers: Sheldon Huang, Ivan Zhang, Aidan Gomez, Muhammad Osama and Bryan Li, the FOR.ai research team (photo courtesy of FOR.ai).

To break the Enigma code during the Second World War, British computer scientist Alan Turing developed a mathematical model to unlock the cipher faster than any human.

Today, a group of 重口味SM undergraduate computer science students are decoding encrypted text using a neural network, a framework for machine learning algorithms inspired by the brain.

鈥淲e're at a stage [in our research] where we can pretty confidently say that the architecture works, and it's more general than anything that's been previously developed,鈥 says Aidan Gomez, a fourth-year student in the department of computer science. Their accuracy results are above 95 per cent, he adds.

Gomez and the research team Sheldon Huang, Bryan Li, Muhammad Osama and Ivan Zhang, are 2018 fellows of AI Grant, a recently established non-profit that provides select projects nearly $50,000 in cloud computing resources from Google, among others. They also receive exclusive access to a global AI network of mentors including Andrej Karpathy, a U of T alumnus who was formerly of OpenAI and now director of AI at Tesla. 

Roger Grosse, an assistant professor in the department of computer science, and Lukasz Kaiser, senior research scientist at Google Brain, help mentor the team.

Similar to natural language translation tasks, their project uses plain text, or English, and cipher text as two different languages. The neural network reads both and makes connections between the two without any additional support in translation. 

Gomez says the method is able to crack a much more complicated cipher called Vigen猫re, historically termed the indecipherable cipher, where a hidden key is only known to the sender and recipient. The key determines an entirely different Caesarian, or shift cipher, to be used at each position; meaning the neural network can no longer simply count the frequencies of letters and perform simple frequency analysis. 

Read more about Aidan Gomez

鈥淭his is a much more complicated cipher to crack and it鈥檚 part of the goal of getting closer and closer to the complexity of unsupervised language translation itself,鈥 says Gomez.  

Huang, who is also president and co-founder of the FOR.ai partner organization, , or UTMIST, says their approach is fundamentally different from current approaches that are supervised with human feedback or labelled data 鈥 not unlike the task of translating the alien language seen in the movie Arrival.

鈥淭hey crack the language by making connections between two languages, word by word,鈥 says Huang. 

鈥淣one of [the algorithm] is hard-coded or relying on a human鈥檚 knowledge of language,鈥 says Gomez. 鈥淲e came up with an architecture than can infer those mappings independently.鈥 

The group says cracking modern ciphers is impractical, and provably assured to be too difficult. With an end-goal of unsupervised language translation, say English to German based on two completely unrelated texts, their cipher methods could be used to unlock lost languages, when native speakers no longer exist.  

鈥淭his projects clearly demonstrates a neural network鈥檚 capacity to build up a really strong model of language, and then apply that to drawing connections between two abstract languages,鈥 says Gomez. 

The FOR.ai team will be looking to recruit members interested in participating in their machine learning research. But Zhang forewarns it is gruelling 鈥 though intensely gratifying 鈥 work.

鈥淓ven at a very high-tech lab like [the department of computer science鈥檚 machine learning] group, these things still take days [to perform],鈥 says Zhang. 

鈥淎 lot of hardware, running experiments 鈥 and a lot of epiphanies.鈥

 

 

The Bulletin Brief logo

Subscribe to The Bulletin Brief

Computer Science