Disambiguation – Linking Data Science and Engineering | NLP Summit 2020
Автор: John Snow Labs – Healthcare AI Company
Загружено: 2021-01-07
Просмотров: 636
Описание:
Get your Free Spark NLP and Spark OCR Free Trial: https://www.johnsnowlabs.com/spark-nl...
Register for NLP Summit 2021: https://www.nlpsummit.org/2021-events/
Watch all NLP Summit 2020 sessions: https://www.nlpsummit.org/
Disambiguation or Entity Linking is the assignment of a knowledge base identifier (Wikidata, Wikipedia) to a named entity. Our goal was to improve an MVP model by adding newly created knowledge while maintaining competitive F1 scores.
Taking an entity linking model from MVP into production in a spaCy-native pipeline architecture posed several data science and engineering challenges, such as hyperparameter estimation and knowledge enhancement, which we addressed by taking advantage of the engineering tools Docker and Kubernetes to semi-automate training as an on-demand job.
We also discuss some of our learnings and process improvements that were needed to strike a balance between data science goals and engineering constraints and present our current work on improving performance through BERT-embedding based contextual similarity.
Повторяем попытку...
Доступные форматы для скачивания:
Скачать видео
-
Информация по загрузке: