How Neural Machine Translation Will Change E-Discovery

When a global enterprise gets sued, it’s vital to know who is involved and how. But finding out who to blame isn’t always simple.

Global law firms are tasked with sifting through thousands, sometimes millions of emails, chats, and legal documentation during eDiscovery. These documents and audio recordings could be in many different languages and stored around the world. Sometimes that data is stored in countries with strong data protection regulations, such as Brazil and parts of the EU, so it cannot under any circumstances leave the country.

So, how can an office in the U.S. review hundreds of days of correspondence in multiple languages?

If the firm hires translators, they’ll need dozens with a strong knowledge of everything from slang to deep subject matter expertise of the topic in discovery. If instead they decide to go with an e-discovery translation solution, they’ll still need help during the review process, especially for data in Asian languages – there are several ways to interpret one word, for which there may be five slang alternatives. In either case, the team must spend a lot of time and money to get reliable and accurate results.

Until now, that is.

Continue reading

SYSTRAN’s Continuing Neural MT Evolution

by Kirti Vashee on eMpTy Pages, a blog about translation technology, localization and collaboration

Recently, I had the opportunity and kind invitation to attend the SYSTRAN community day event where many members of their product development, marketing, and management team gathered with major customers and partners.

The objective was to share information about the continuing evolution of their new Pure Neural MT (PNMT) technology,  share detailed PNMT output quality evaluation results, and provide initial customer user experience data with the new technology. Also, naturally such an event creates a more active and intense dialogue between company employees and customers and partners.  This, I think has substantial value for a company that seeks to align product offerings with its customer’s actual needs.

Ongoing Enhancements of the PNMT Product Offering

The event made it clear that SYSTRAN is well down the NMT path, possibly years ahead of other MT vendors, and provided a review of the current status of their rapidly evolving PNMT technology.

Read more

We are SYSTRAN and we love languages…

Click here to read the french version

SYSTRAN Banner

We are SYSTRAN. We love languages, lots of languages. We are a human-sized company but we have linguists for almost all of the 140 language pairs we support.  That’s a big number, but don’t be misled- some of us are fluent in many languages. Nevertheless, we love languages and we don’t believe in the one-fits-all technology regarding language processing.

Continue reading

How does Neural Machine Translation work?

The representation of meaning in Neural, Rule-Based and Phrase-Based Machine Translation

In this issue of step-by-step articles, we explain how neural machine translation (NMT) works and compare it with existing technologies: rule-based engines (RBMT) and phrase-based engines (PBMT, the most popular being Statistical Machine Translation – SMT).

The results obtained from Neural Machine Translation are amazing, in particular, the neural network’s paraphrasing. It almost seems as if the neural network really “understands” the sentence to translate. In this first article, we are interested in “meaning,” that which gives an idea of the type of semantic knowledge the neural networks use to translate.

Let us start with a glimpse of how the 3 technologies work, the different steps of each translation process and the resources that each technology uses to translate. Then we will take a look at a few examples and compare what each technology must do to translate them correctly.

Continue reading

SYSTRAN is far along the NMT path

A Deep Dive into SYSTRAN’s Neural Machine Translation (NMT) Technology

by Kirti Vashee on eMpTy Pages, a blog about translation technology, localization and collaboration

Neural Machine Translation

[…] So, I recently had a conversation with Jean Senellart , Global CTO and SYSTRAN SAS Director General, to find out more about their new NMT technology. He was very forthcoming, and responded to all my questions with useful details, anecdotes and enthusiasm. The conversation only reinforced in my mind that “real MT system development” is something best left to experts, and not something that even large LSPs should dabble with. The reality and complexity of NMT development pushes the limits of MT even further away from the DIY mirage.

In the text below, I have put quotes around everything that I have gotten directly from SYSTRAN material or from Jean Senellart (JAS) to make it clear that I am not interpreting. I have done some minor editing to facilitate readability and “English flow” and added comments in italics within his quotes where this is done.

Read more