homehome Home chatchat Notifications


Facebook's new AI can translate from one programming language to another

This could save billions of dollars and years of work.

Mihai Andrei
June 18, 2020 @ 2:35 pm

share Share

Translating from one language to another is not just a nuisance. The Commonwealth Bank of Australia spent 5 years and a whopping $750 million to convert the code on their platform from COBOL (a programming language first developed over 50 years ago) to Java.

This is hardly an exception. Worldwide, 95% of ATM swipes and 80% of in-person transactions use COBOL, and much scientific and industrial research is still done in Fortran, another legacy programming language.

A surprisingly large part of the data in our day to day lives is based on old programming languages, and translating them to newer and more efficient languages is extremely expensive and time-consuming.

This is where Facebook’s AI comes in.

Example. Credits: Lachaux et al.

According to a new paper, researchers have developed a transcompiler — a system that converts source code from a high-level programming language (such as C++ or Python) to another. When done by humans, this migration is challenging and time-consuming as it requires extensive knowledge in both languages. But if you could train an algorithm to do that for you, you’d save a lot of time and resources.

Facebook researchers trained the new Artificial Intelligence (AI) on open source GitHub projects. The AI is unsupervised, so it learns on its own, mostly by looking for patterns in datasets. It requires a minimal amount of human supervision and expertise.

The TransCoder (as the AI was called) was trained on 2.8 million open source repositories, targeting translation at the function level. The AI started by looking for common keywords like “for,” “while,” “if,” and “try” and also digits, mathematical operators, and common English words or strings that appear in the source code. After the initial training period, the algorithm also undergoes a process of denoising and back-translating components (which was only done on functions).

This is not the first time something like this was attempted — several translation algorithms have already been developed, but Facebook engineers say this AI outperforms them by a ‘significant’ margin.

The results weren’t perfect. The TransCoder was tested on 852 parallel functions in all the 3 languages, exhibiting remarkable (but not perfect) accuracy. When translating from C++ to Java, 74.8% of the functions returned the expected output. From C++ to Python, the figure was 67.2%. The highest accuracy was obtained when translating from Java to C++ (91.6%), and the lowest was obtained from Python to Java (56.1%).

So it’s still not exactly perfect, but the approach has promise — and to make things even better, it can be easily adapted to a number of different programming languages with ease.

“TransCoder can easily be generalized to any programming language, does not require any expert knowledge, and outperforms commercial solutions by a large margin,” the coauthors wrote. “Our results suggest that a lot of mistakes made by the model could easily be fixed.”

While the algorithm hasn’t yet been adapted to languages such as COBOL, it’s only one step away. A quick and cheap revolution could finally be coming to out ATMs.

share Share

This 5,500-year-old Kish tablet is the oldest written document

Beer, goats, and grains: here's what the oldest document reveals.

A Huge, Lazy Black Hole Is Redefining the Early Universe

Astronomers using the James Webb Space Telescope have discovered a massive, dormant black hole from just 800 million years after the Big Bang.

Did Columbus Bring Syphilis to Europe? Ancient DNA Suggests So

A new study pinpoints the origin of the STD to South America.

The Magnetic North Pole Has Shifted Again. Here’s Why It Matters

The magnetic North pole is now closer to Siberia than it is to Canada, and scientists aren't sure why.

For better or worse, machine learning is shaping biology research

Machine learning tools can increase the pace of biology research and open the door to new research questions, but the benefits don’t come without risks.

This Babylonian Student's 4,000-Year-Old Math Blunder Is Still Relatable Today

More than memorializing a math mistake, stone tablets show just how advanced the Babylonians were in their time.

Sixty Years Ago, We Nearly Wiped Out Bed Bugs. Then, They Started Changing

Driven to the brink of extinction, bed bugs adapted—and now pesticides are almost useless against them.

LG’s $60,000 Transparent TV Is So Luxe It’s Practically Invisible

This TV screen vanishes at the push of a button.

Couple Finds Giant Teeth in Backyard Belonging to 13,000-year-old Mastodon

A New York couple stumble upon an ancient mastodon fossil beneath their lawn.

Worms and Dogs Thrive in Chernobyl’s Radioactive Zone — and Scientists are Intrigued

In the Chernobyl Exclusion Zone, worms show no genetic damage despite living in highly radioactive soil, and free-ranging dogs persist despite contamination.