ycliper

Популярное

Музыка Кино и Анимация Автомобили Животные Спорт Путешествия Игры Юмор

Интересные видео

2025 Сериалы Трейлеры Новости Как сделать Видеоуроки Diy своими руками

Топ запросов

смотреть а4 schoolboy runaway турецкий сериал смотреть мультфильмы эдисон
Скачать

Fuzzy String Matching in R | Overview and R Tutorial (Using fuzzywuzzy, polyfuzz, and difflib)

Автор: Melissa Van Bussel (ggnot2)

Загружено: 2022-03-28

Просмотров: 9182

Описание: In today's video, we'll learn about fuzzy string matching (also known as approximate string matching) and how to perform it in R.

A common use case for fuzzy string matching is when we want to join two datasets. Perhaps these datasets have a variable in common, but the information in one dataset is expressed slightly differently than the information in the other (e.g., “Amazon” vs. “Amazon.com, Inc”). How can we determine if these two variables are referring to the same thing? We can use fuzzy string matching, a popular Natural Language Processing (NLP) technique!

We'll start with a conceptual overview of fuzzy string matching, and then look at some examples in R using several different algorithms. We’ll use fuzzywuzzy, polyfuzz, and difflib – currently the most popular packages for performing this task. Among others, some of the string matching algorithms that are implemented in these packages include Levenshtein Distance (sometimes called "Edit Distance") and Gestalt Pattern Matching (sometimes called "Ratcliff/Obershelp Pattern Matching").

The code, slides, and dataset used in this video can be found here: https://github.com/melissavanbussel/Y...

The dataset originated from Kaggle: https://www.kaggle.com/code/leandrodo...

The blog post about PolyFuzz referenced in the video is located here: https://towardsdatascience.com/string...

If you like this video, please subscribe to my channel so that I can continue to make content like this! 😊

0:00 - Overview of fuzzy string matching
3:49 - Fuzzy string matching in R
9:53 - Using the difflib package
16:32 - Using the fuzzywuzzy package
19:58 - Using the polyfuzz package

Не удается загрузить Youtube-плеер. Проверьте блокировку Youtube в вашей сети.
Повторяем попытку...
Fuzzy String Matching in R | Overview and R Tutorial (Using fuzzywuzzy, polyfuzz, and difflib)

Поделиться в:

Доступные форматы для скачивания:

Скачать видео

  • Информация по загрузке:

Скачать аудио

Похожие видео

Fuzzy Matching in R (Example) | Approximate String, Name & Text Search | adist(), agrep() & amatch()

Fuzzy Matching in R (Example) | Approximate String, Name & Text Search | adist(), agrep() & amatch()

R-Ladies Baltimore | Reticulate RStudio and your R code with Python | Timbers

R-Ladies Baltimore | Reticulate RStudio and your R code with Python | Timbers

How Fuzzy Text Search Works

How Fuzzy Text Search Works

Нечеткий регулятор и его реализация на языке C / How to implement fuzzy logic control in c

Нечеткий регулятор и его реализация на языке C / How to implement fuzzy logic control in c

Программа «Статус» с Екатериной Шульман и Максимом Курниковым | 20.01.2026

Программа «Статус» с Екатериной Шульман и Максимом Курниковым | 20.01.2026

Fuzzy String Matching in Python

Fuzzy String Matching in Python

How to create a custom classification model in R using the openai package (fine-tune tutorial)

How to create a custom classification model in R using the openai package (fine-tune tutorial)

Record Linkage: Probabilistic Matching

Record Linkage: Probabilistic Matching

Webscraping in R

Webscraping in R

Exploring NLP Fuzzy Matching Algorithms

Exploring NLP Fuzzy Matching Algorithms

How to Create an Interactive Infographic Using R (Displayr Tutorial)

How to Create an Interactive Infographic Using R (Displayr Tutorial)

Dplyr Essentials (easy data manipulation in R): select, mutate, filter, group_by, summarise, & more

Dplyr Essentials (easy data manipulation in R): select, mutate, filter, group_by, summarise, & more

Python Text Fuzzy Search Tutorial | RapidFuzz FuzzyWuzzy Alternative

Python Text Fuzzy Search Tutorial | RapidFuzz FuzzyWuzzy Alternative

Fuzzy Matching for Text Strings in Spotfire

Fuzzy Matching for Text Strings in Spotfire

Modeling hotel bookings in R using tidymodels and recipes

Modeling hotel bookings in R using tidymodels and recipes

Hadley Wickham: Managing many models with R

Hadley Wickham: Managing many models with R

Обработка данных с помощью R за 27 минут

Обработка данных с помощью R за 27 минут

ОБЫЧНЫЙ VPN УМЕР: Чем обходить блокировки в 2026

ОБЫЧНЫЙ VPN УМЕР: Чем обходить блокировки в 2026

Tidyverse in R - tips & tricks

Tidyverse in R - tips & tricks

Fuzzy string matching using Python

Fuzzy string matching using Python

© 2025 ycliper. Все права защищены.



  • Контакты
  • О нас
  • Политика конфиденциальности



Контакты для правообладателей: [email protected]