ycliper

Популярное

Музыка Кино и Анимация Автомобили Животные Спорт Путешествия Игры Юмор

Интересные видео

2025 Сериалы Трейлеры Новости Как сделать Видеоуроки Diy своими руками

Топ запросов

смотреть а4 schoolboy runaway турецкий сериал смотреть мультфильмы эдисон
Скачать

R Tutorial: What are survey weights?

Автор: DataCamp

Загружено: 2020-03-10

Просмотров: 29086

Описание: Want to learn more? Take the full course at https://learn.datacamp.com/courses/an... at your own pace. More than a video, you'll learn hands-on coding & quickly apply skills to your daily work.



---


Hi! I am Kelly McConville, a survey statistician, and professor. Welcome to my course on analyzing survey data.


Now I am wondering if you have ever found yourself in the following situation: You have a question you want to answer. You found a great dataset to answer that question and then there’s this column in the dataset that represents survey weights. And you ask yourself: What are those? Can I ignore those?


Well, let’s pretend we have found ourselves in this situation. We want to estimate the average household income in the US. We find that the Bureau of Labor Statistics provides a public use dataset. And, this dataset includes the variable FINCBTAX, given in the second column here, which is the amount of household income before taxes in 2016. But the first column in the dataset is a survey weight variable, FINLWT21. How should these weights impact our analyses?


First, we should ask: what are survey weights? Survey weights result from data that were collected under a complex sampling design. The weights tell us the number of individuals in the population that each sampled individual represents.


Returning to the BLS sample, the first weight equals 25,985, which means that the first sampled household in the dataset represents 25,985 households in the population. The second represents 6,581 households.


Now that we know what survey weights are, the question remains: How will they impact our analyses?


Let’s consider a common goal for survey data: to estimate a population quantity. Suppose this picture reflects all households in the US where each green box is an individual household.


And we want to estimate the average household income. Then y_i is the income for the ith household, U represents all US households, and capital N is the total number of households. Then, the fancy notation can be read to say that mu, the average household income, equals the sum of all the incomes, divided by the total number of households.


Of course, we can only calculate mu if we have income data for every household.


But we don’t. Instead, BLS takes a sample of households, represented by the blue squares, using a complex sampling design. We will call that sample s. They only collect income data for the n households in the sample.


Now to estimate mu, we can calculate the sample average, which is called y-bar. y-bar is the average income for the households in BLS's sample.


To calculate the sample mean for the BLS survey, we must insert the income variable, FINCBTAX, from the Consumer Expenditure dataset, denoted by ce, into the mean() function. Remember we can call a variable using the syntax dataset$variable_name. The average household income for the Consumer Expenditure sample is $62,480. Is this a good estimate of the average income of ALL US households?


Probably not. The problem is that the sample mean assumes all households in the sample represent the same number of households in the population. But when we looked at the survey weights, we learned that just isn't true!


And remember, for the sampled households, we have both the income data and the survey weights. To properly estimate the mean income, we need to use both when constructing our estimator.


But how do I incorporate the sampling design into my estimates, my data visualizations, my models? Well, that's exactly what we will learn to do in this course.


But first, let's practice exploring the weights themselves.


#DataCamp #RTutorial #AnalyzingSurveyDatainR #AnalyzingSurveyData

Не удается загрузить Youtube-плеер. Проверьте блокировку Youtube в вашей сети.
Повторяем попытку...
R Tutorial: What are survey weights?

Поделиться в:

Доступные форматы для скачивания:

Скачать видео

  • Информация по загрузке:

Скачать аудио

Похожие видео

What is Survey Weights? by Natalie Shlomo

What is Survey Weights? by Natalie Shlomo

Выучите R за 39 минут

Выучите R за 39 минут

W7: Using survey weights in R

W7: Using survey weights in R

Survey Data Analysis: NHANES sampling, survey features, weights, inference, variance, subpopulation

Survey Data Analysis: NHANES sampling, survey features, weights, inference, variance, subpopulation

Deep dive into health survey data analysis for R using DHS and NHANES

Deep dive into health survey data analysis for R using DHS and NHANES

Structural Equation Modeling: what is it and what can we use it for? (part 1 of 6)

Structural Equation Modeling: what is it and what can we use it for? (part 1 of 6)

What are confidence intervals? Actually.

What are confidence intervals? Actually.

Part II: Introduction of Principles of DHS Sampling Weights

Part II: Introduction of Principles of DHS Sampling Weights

SBE CCC: Using weights when analyzing survey data: Descriptive Statistics vs. Regression Modeling

SBE CCC: Using weights when analyzing survey data: Descriptive Statistics vs. Regression Modeling

R-Ladies RTP (English) - Tidy Data, Weighted Insights: Analyzing Complex Survey Data in R

R-Ladies RTP (English) - Tidy Data, Weighted Insights: Analyzing Complex Survey Data in R

Analyzing Categorical Data from the General Social Survey in Python

Analyzing Categorical Data from the General Social Survey in Python

Complex Survey Designs and Weighting Using Stata: Part 1

Complex Survey Designs and Weighting Using Stata: Part 1

4 Hours Chopin for Studying, Concentration & Relaxation

4 Hours Chopin for Studying, Concentration & Relaxation

Questionnaire Data Preparation in R

Questionnaire Data Preparation in R

Webinar: Preparing Survey Data in Tableau Desktop

Webinar: Preparing Survey Data in Tableau Desktop

Открытие Варбурга: 4 переключателя, которые мешают раку расти | Здоровье с Доктором

Открытие Варбурга: 4 переключателя, которые мешают раку расти | Здоровье с Доктором

Sade - Ultimate

Sade - Ultimate

RStudio Tutorial for Beginners: Introduction to R Studio and Basics of R

RStudio Tutorial for Beginners: Introduction to R Studio and Basics of R

The Essential Main Ideas of Neural Networks

The Essential Main Ideas of Neural Networks

Part IV: Demonstration of How to Weight DHS Data in SPSS & SAS

Part IV: Demonstration of How to Weight DHS Data in SPSS & SAS

© 2025 ycliper. Все права защищены.



  • Контакты
  • О нас
  • Политика конфиденциальности



Контакты для правообладателей: [email protected]