WIPO Develops cutting-edge translation tool for Patent Documents
The World Intellectual Property Organization has developed a ground-breaking new “artificial intelligence”-based translation tool for patent documents, handing innovators around the world the highest-quality service yet available for accessing information on new technologies
WIPO Translate now incorporates cutting-edge neural machine translation technology to render highly technical patent documents into a second language in a style and syntax that more closely mirrors common usage, out-performing other translation tools built on previous technologies.
WIPO has initially “trained” the new technology to translate Chinese, Japanese and Korean patent documents into English. Patent applications in those languages accounted for some 55% of worldwide filings in 20141. Users can already try out the Chinese-English translation facility on the public beta test platform.
“One of the aims of the patent system is to make technology available. Language is a barrier to the universal achievement of that aim. This breakthrough for WIPO Translate means that a vast, and ever-increasing, trove of patent documents will soon be more easily accessible to innovators who search these records for inspiration or technical know-how,” said WIPO Director General Francis Gurry. “As part of a global trend, patent applications are increasingly being filed in East Asian languages, particularly in Chinese, and WIPO Translate helps ensure that state-of-the-art knowledge created in these languages is shared as widely and rapidly as possible.”
The high level of accuracy of the Chinese-English translation is the result of the training of the neural machine translation tool, which compared 60 million sentences from Chinese patent documents provided to WIPO’s PATENTSCOPE database by the State Intellectual Property Office of the People’s Republic of China with their translations as filed at the United States Patent and Trademark Office.
WIPO plans to extend the neural machine translation service to French-language patent applications, with other languages to follow. The PATENTSCOPE database integrates with other translation engines freely available on the internet and continues to use existing statistical-based translation technology for languages where it performs well. WIPO has shared its translation software with other international organizations, including the United Nations conference management service, Food and Agriculture Organization, International Telecommunication Union, International Maritime Organization, World Trade Organization, and The Global Fund to Fight AIDS, Tuberculosis and Malaria. “At the United Nations, we are following with great interest the new developments on neural machine translation at WIPO, our partners in this innovative area,” said Cecilia Elizalde, Director of the UN Documentation Division. “Our translation system is integrated with the statistical version of WIPO Translate and translators find it very useful. We look forward to upgrading it to the neural version in the next months.”
Neural machine translation is an emerging technology. It is based on huge neural network models that “learn” from previously translated sentences. The specificity of neural machine translation (compared to previous “phrase based” statistical methods) is that it produces more natural word order, with particular improvements seen in so-called distant language pairs, like Japanese-English or Chinese-English.
In a recent test, WIPO Translate’s neural-based machine translation service substantially out-performed both the previous statistical-based model on distant language pairs, as well as other non-WIPO translation services. Since this WIPO tool is trained and focused uniquely on patent documents, instead of a more-disparate array of texts, it gives higher-quality renderings.
WIPO created its own software, based on open-source software and libraries (Nematus – Theano, AmuNMT), and capitalized on in-house expertise in handling large datasets.
The PATENTSCOPE database provides access to international Patent Cooperation Treaty (PCT) applications in full text format on the day of publication, as well as to patent documents of participating national and regional patent offices. The information may be searched by entering keywords, names of applicants, international patent classification and many other search criteria in multiple languages. The database contains some 58 million records.
WIPO, the World Intellectual Property Organization is the global forum for intellectual property policy, services, information and cooperation. A specialized agency of the United Nations, WIPO assists its 189 member states in developing a balanced international IP legal framework to meet society’s evolving needs. It provides business services for obtaining IP rights in multiple countries and resolving disputes. It delivers capacity-building programs to help developing countries benefit from using IP. And it provides free access to unique knowledge banks of IP information.