Data Skeptic

Have you ever wondered what goes on under the hood when you accept a website’s cookies? Today, Maximilian Hils, a PhD student in Computer Science, at the University of Innsbruck, Austria, dissects the ad tech industry and the standards put in place to protect users’ data. He also shares his thoughts on the use of VPNs as well as other tools that help shield your data from prying eyes on the internet.

Click here for additional show notes

Thanks to our sponsor:
https://clear.ml/ ClearML is an open-source MLOps solution users love to customize, helping you easily Track, Orchestrate, and Automate ML workflows at scale.

Direct download: privacy-preference-signals.mp3
Category:general -- posted at: 6:00am PDT

Ravi Krishna joins us today to talk about his recent work on a differentiable NAS framework for ads CTR prediction. He discussed what CTR prediction is about and why his NAS framework helps in building neural networks for better ads recommendation. Listen to learn about methodology, related literature and his results.

Click for additional show notes

Thanks to our sponsor:
https://astrato.io Astrato is a modern BI and analytics platform built for the Snowflake Data Cloud. A next-generation live query data visualization and analytics solution, empowering everyone to make live data decisions.

Direct download: neural-architecture-search-for-ctr-prediction.mp3
Category:general -- posted at: 8:19am PDT

Effectively managing a large budget of pay per click advertising demands software solutions. When spending multi-million dollar budgets on hundreds of thousands of keywords, an effective algorithmic strategy is required to optimize marketing objectives.

In this episode, Nathan Janos joins us to share insights from his work in the ad tech industry.

Click for additional show notes

Thanks to our sponsor!
https://wandb.com/ The developer-first MLOps platform. Build better models faster with experiment tracking, dataset versioning, and model management.

Direct download: algorithmic-ppc-management.mp3
Category:general -- posted at: 3:10pm PDT

Increasingly, people get most if not all of the information they consume online. Alongside the web sites, videos, apps, and other destinations, we’re consistently served advertisements alongside the organic content we search for or discover. Targetted ads make it possible for you to discover relevant new products you might otherwise not have heard about. Targetting can also open a pandora’s box of ethical considerations. Online advertising is a complex network of automated systems. Algorithms controlling algorithms controlling what we see.

This season of Data Skeptic will focus on the applications of data science to digital advertising technology. In this first episode in particular, Kyle shares some of his own personal experiences and insights working in pay-per-click marketing.

Click for additional show notes

 

 

Direct download: ad-tech.mp3
Category:general -- posted at: 8:44pm PDT

Our mobile phones generate an incredible amount of data inbound and outbound. In today’s episode, Nishant Kishore, a PhD graduate of Harvard University in Infectious Disease Epidemiology, explains how mobility data from mobile phones can be captured and analysed to understand the spread of infectious diseases.

Click here for additional show notes

Thanks to our sponsor!
https://neptune.ai/ Log, store, query, display, organize, and compare all your model metadata in a single place

Direct download: the-reliability-of-mobile-phone-data.mp3
Category:general -- posted at: 10:31pm PDT

The pandemic changed how we lived. And this had a ripple effect on the performance of machine learning models. Ravi Parikh joins us today to discuss how the pandemic has affected the performance of machine learning models in clinical care and some actionable steps to fix it.

Click here for additional show notes

Thanks to our sponsor:
Astera Centerprise is a no-code data integration platform that allows users to build ETL/ELT pipelines for modern data warehousing and analytics.

Direct download: haywire-algorithms.mp3
Category:general -- posted at: 6:00am PDT

Carly Lupton-Smith joins us today to speak about her research which investigated the consistency between household and county measures of school reopening. Carly is a doctoral researcher in Biostatistics at Johns Hopkins Bloomberg School of Public Health. Listen to know about her findings.

Click here for additional show notes on our website!

Thanks to our sponsor!
ClearML is an open-source MLOps solution users love to customize, helping you easily Track, Orchestrate, and Automate ML workflows at scale.

Astera Centerprise is a no-code data integration platform that allows users to build ETL/ELT pipelines for modern data warehousing and analytics.

 

Direct download: school-reopening-analysis.mp3
Category:general -- posted at: 7:00am PDT

Today, we are joined by Alexander Thor, a Product Manager at Vizlib, makers of Astrato. Astrato is a data analytics and business intelligence tool built on the cloud and for the cloud. Alexander discusses the features and capabilities of Astrato for data professionals.

Visit our website for additional show notes!

 

Direct download: modern-data-stacks.mp3
Category:general -- posted at: 7:00am PDT

Emojis are arguably one of the most effective ways to express emotions when texting. In today’s episode, Xuan Lu shares her research on the use of emojis by developers. She explains how the study of emojis can track the emotions of remote workers and predict future behavior. Listen to find out more!

Direct download: emoji-as-a-predictor.mp3
Category:general -- posted at: 7:25am PDT

On the show today, Fabian Braesemann, a research fellow at the University of Oxford, joins us to discuss his study analyzing the gig economy. He revealed the trends he discovered since remote work became mainstream, the factors causing spatial polarization and some downsides of the gig economy. Listen to learn what he found. 
Direct download: polarizing-trends-in-the-gig-economy.mp3
Category:general -- posted at: 6:39am PDT