You Ask, I Answer: Hypothesis Formation Without Data Snooping in Marketing Data Science?
Автор: Christopher Penn
Загружено: 2020-04-27
Просмотров: 75
Описание:
Jessica asks, "How would you differentiate hypothesis formation and searching for relevant variables WITHOUT "data snooping"?"
Data snooping, or more commonly known as curve fitting or data dredging, is when you build a hypothesis to fit the data. The way to avoid this is by using evidence not included in the dataset you used to build your hypothesis, which is cross-validation. It's like A/B testing. Most good machine learning tools do this as a best practice, and we should replicate it - they will split a dataset into a training set, a test set, and a validation set. You'll do this best by starting with a sample of your dataset and then adding new data once you've done your initial exploratory data analysis.
Subscribe to my weekly #email newsletter:
http://www.christopherspenn.com/newsl...
Please subscribe to my YouTube channel for more #marketing and #analytics videos!
/ christopherspenn
Need help with your company's #data and #analytics? Let me know:
https://www.trustinsights.ai
Join my free private Slack group, Analytics for #Marketers:
https://www.trustinsights.ai/analytic...
Grab my newest book, AI for Marketers:
http://aiformarketersbook.com
Повторяем попытку...
Доступные форматы для скачивания:
Скачать видео
-
Информация по загрузке: