Since then, this legislation has become a cornerstone for business compliance programs. Organizations around the country engage in compliance training to ensure that their employees aren’t found to be acting unethically or illegally when procuring new business in foreign countries.
As part of our webinar series, one of our latest broadcast discussed and demonstrated the unique and innovative Language I/O + SYSTRAN solution, created in collaboration with our partner company Language I/O.
Hosted by J. Obakhan from SYSTRAN and Heather Shoemaker, CEO of Language I/O, the webinar discussed the power of integrating machine language translation technology into the customer care workflow.
Our webinar “Get More From SPNS9” on May 15th, 2020 was a huge success. The webinar demonstrated 6 new exciting upgrades to the SYSTRAN Pure Neural Server 9.6’s, further scaling its technological capabilities. Thank you to those who joined us.
In this post, we have compiled the highlights from the presentation and answers to the questions we receive after.
The minds behind SYSTRAN sit down for an interview regarding the complexities and the capacities of specialized neural machine translation engines.
Participants: Peter Zoldan, Senior Data Engineer -Software Engineer Linguistic Program, Svetlana Zyrianova, Linguistic Program, Petra Bayrami, Jr. Software Engineer – Linguistic Program, Natalia Segal, R&D Engineer.
How much data is required to create a specialized engine?
The more bilingual data, the better the quality. For broad domains such as news, millions of bilingual sentences will be required. However, if the domain is narrow, such as technical support documents for certain products, then even a small set of sentences of 50,000, noticeably improves the quality.
The amount of data required will depend on how broad or narrow the domain is you are specializing the engine into.
Language is messy. Ask any person who has ever had to learn a second language and they will tell you that the most difficult aspect isn’t learning all the rules, but understanding the exceptions to the rules — the real-world application of the language.
When it comes to protecting classified data, blackout redaction has been in use for at least a century. While it is not the only acceptable form of data sanitization, it is historically the oldest and most commonly utilized by eDiscovery firms. This is despite the fact there are more modern and easy-to-use alternatives that save time and reduce errors. The two main data sanitization alternatives that meet legal requirements include anonymization and pseudonymization.
Last month, we conducted a webinar “So, You Think Your Game Is Localized?”, the first of a 3-part-series given by Elizabeth Senouci from XTM International, and Victor Ramirez from SYSTRAN.
If you couldn’t guess by the title, “So, You Think Your Game Is Localized?” was a webinar focused on Video Game Localization. Senouci and Ramirez are both experts on the topic and thus decided to share their knowledge with the video games community.
In the webinar, Senouci and Ramirez discussed the need for game localization, some basic terminologies associated with it, user interfaces, global marketing, and the importance of customer service.
“Localization isn’t just one thing you can do and just get done with it. It’s a holistic process and it’s actually customized based on your game, your product,” Elizabeth said in her intro.
For staff of multinational companies who want to translate a simple phrase or word, systems like Google or Microsoft come in just handy. They help you order a taxi in Japan, pay your restaurant bill in France, and impress your clients with a hearty “jó reggelt” (“good morning”) in Budapest. The problem is such tools are notorious for imprecise translations and data leaks.
Would you really want to use Google Translate for that internal email to your affiliates in another country?
On the other hand, research from the European Parliament shows that on average a common language increases trade flows by 44%. So, how do you – and your staff – hack through language barriers and achieve professional communication in the business world?
Machine Translation users care about quality and performance. Based on our own observations and the feedback we’ve received; the quality of our Neural MT is impressive. Evaluating performance is a stickier subject, but we’d like to dig our hands in and present our innovations and achievements and how it benefits NMT users.
By performance we mostly mean the manner in which a system performs in terms of speed and efficiency in varying production environments. It is important to note that performance and quality in Neural MT are tightly connected: it is easy to accelerate a given model compromising on the quality. Therefore, when evaluating performance improvement, we always check that quality remains very close to optimal quality.
Since switching to NMT at the end of 2016, we’ve invested our R&D efforts into optimizing our engines to be more efficient, while maintaining and even improving translation accuracy. Our latest, 2nd generation NMT engines, available in our latest release of SYSTRAN Pure Neural® Server, implements several technical optimizations that make the translation faster and more efficient.
New model architecture
The first generation of neural translation engines was based on recurrent neural networks (RNN). This architecture requires the source text to be encoded sequentially, word by word, before generating the translation.
Until recently, using machine translation (MT) was considered a hindrance by serious translators. Now that machine translation is powered by artificial intelligence, translators in the government are intrigued by this new technology. Forward-thinking linguist programs recognize the value of MT, and it’s only a matter of time when others will follow suit. Consider these four reasons as motivation for modernizing the status quo:
1. Translate Smarter
As with many other skilled professions – accountants, doctors, analysts – technology is a time-saver. Translators now have the same benefit. In fact, commercial benchmarks show that neural MT helps translators post-edit at 2000 words per hour. Without technology, which is typically the case in the government, translators translate at 300 words per hour. Imagine the time-savings — the same 6000-word document can now be translated in 3 hours instead of 20. Additionally, SYSTRAN MT will retain the formatting of the original document, which further saves time.