# 130 Yasmin Moslem on Using Large Language Models to Custom Train Machine Translation
Автор: Slator
Загружено: 2022-09-02
Просмотров: 694
Описание:
Machine Translation (MT) Researcher, Yasmin Moslem, joins SlatorPod to talk about her research on Domain-Specific Text Generation for Machine Translation — a project she conducted with Rejwanul Haque, John D. Kelleher, and Andy Way at the Adapt Center in Dublin.
Yasmin shares her experience working as a translator, discovering translation productivity (CAT) tools, and experimenting with translation memory to improve MT. She breaks down the paper’s approach to domain-specific MT training using back-translation for data augmentation.
She discusses how some LSPs are already implementing this approach in real-life, customizing it for different use cases. She explains why they used a combination of BLEU, Comet, and other quality evaluation frameworks as well as human evaluation to rate machine translation quality.
Yasmin concludes the podcast with her advice for those in the core industry looking to enter the machine translation space, from the spiral learning process to reading research papers.
First up, Florian and Esther discuss the language industry news of the week, including how a streaming platform used propriety machine dubbing technology for its film offerings in the first quarter of 2022.
Over in London, TransPerfect acquired a virtual data room (VDR) tech company to proactively address the VDR market. In transcription news, VIQ Solutions’ shares dipped by 20% despite reporting strong, half-year revenue growth of 45% year on year. Meanwhile, multilingual captioning provider Ai-Media turns EBITDA-profitable as a 2021 acquisition drives revenue growth.
Read the paper here: https://arxiv.org/abs/2208.05909
Chapter Markers:
00:00:00 Agenda and Intro
00:02:10 Machine Dubbing in Real Life
00:07:04 TransPerfect Addresses VDR Market
00:12:41 Ai-Media and VIQ Solutions Financial Results
00:17:24 Yasmin Moslem Joins the Pod
00:17:59 Academic and Professional Background
00:23:47 Domain-Specific Text Generation for MT
00:31:38 Medical and Financial Use Cases
00:35:02 Synthetic Text Generation
00:39:41 Application of Machine Translation Research
00:42:34 Real-Life Application by LSPs
00:47:32 Using BLEU as a Quality Measure
00:51:53 Breaking Into the Machine Translation Industry
Повторяем попытку...
Доступные форматы для скачивания:
Скачать видео
-
Информация по загрузке: