New PyMCon Talk Released: Missing Value Imputation with Item Response Theory by Allen Downey & Ricardo Vieira

purna135 · December 2, 2023, 5:35pm

Hi Everyone

Come to our next PyMCon Web Series! We’re talking about ‘Missing Value Imputation with Item Response Theory’

Speakers:

Allen Downey, Professor Emeritus at Olin College, and the author of Think Python , Think Bayes , Think Stats and other books related to computer science and data science.
Ricardo Vieira, PyMC developer and data scientist at PyMC Labs

Event type: Recorded Talk with Live Q&A
Q&A Date/Time: 2023-12-15T15:00:00Z(subscribe here for email updates)
Register for Q&A: Meetup event (to get the Zoom link)
Website: PyMCon Events · PyMCon Web Series

Details:

In many large surveys, not every respondent is asked every question, and not every respondent answers the questions they are asked. So, how can we compare people who answer different sets of questions? One solution is to use item response theory (IRT) to impute missing responses—and nothing pairs better with IRT than Bayesian methods!

In this talk, we will report the results of a friendly competition—a bake-off—between two approaches to this problem: one using grid algorithms and a simplified model, the other using PyMC and a more detailed model. We’ll discuss the implementations, compare the results, and outline their pros and cons.

Content:

Async Talk: https://www.youtube.com/watch?v=uyznG61myy0
Interview video: https://www.youtube.com/watch?v=rvIStfmP1NU
Slides: Bayesian Bake-Off: Grids, MCMC, and IRT - Google Slides

Event Format:

Like other PyMCon events, this one features an asynchronous component along with a synchronous Q&A session. Stay tuned for the prerecorded talk; we will be sharing it soon.

There will be a live Q&A on December 15, 2023, at 7:00 pm PT. Register for the event and bring all your doubts to discuss there.

Here is the link:

About the Speaker:

Allen Downey
Allen is a curriculum designer at Brilliant and Professor Emeritus at Olin College, and the author of Think Python, Think Bayes, Think Stats and other books related to computer science and data science.
He writes a blog about Bayesian statistics and related topics called Probably Overthinking It. And he is working on a book, also called Probably Overthinking It, that will be published by University of Chicago Press in 2023. If you would like to get an occasional update about the book, please join my mailing list.
Dr. Mahmood is a neuroscientist with a PhD from Brandeis University, where he investigated the neural coordination of taste. His research, initially using electrophysiology to probe brain region interactions, hints at a complex network processing flavors. His forthcoming studies aim to unravel this network further, exploring the directional flow of neural information and the impact of feedback mechanisms in taste perception.

Connect with Allen:
Website: https://allendowney.substack.com/
LinkedIn: https://www.linkedin.com/in/allendowney/
Twitter: https://twitter.com/AllenDowney
Mastodon: @allendowney@fosstodon.org
Ricardo Vieira
Ricardo Vieira is a PyMC developer and data scientist at PyMC Labs. He spent several years teaching himself Statistics and Computer Science at the expense of his official degrees in Psychology and Neuroscience.

Connect with Ricardo:
Website: Blog | As long as everything adds up to one
GitHub: ricardoV94 (Ricardo Vieira) · GitHub

purna135 · December 8, 2023, 10:06pm

The full interview with Allen and Ricardo is now available on our YouTube channel, where they share insights into their statistical and PyMC journeys, along with some invaluable advice.

Check out the interview now, and stay tuned for their asynchronous talk tomorrow.

purna135 · December 10, 2023, 3:25pm

The Async talk and slides for this PyMCon event are now live on our YouTube channel!

Watch the Async Talk Now:

Explore the Slides:

In case you haven’t registered for the Q&A session yet, do RSVP now to get the Zoom link:

Save the date, watch the talk, and come prepared with your questions on December 15th! Let’s make it an insightful and engaging session!

foabodo · December 15, 2023, 3:09pm

Question for the 12/15/2023 Q&A: I’d like to know your opinion about the feasibility of the following scenario. Imagine we have 100 students with test scores from Physics I, Physics II, and Nuclear Physics. We also have some demographic data. And 400 students with test scores from Physics I and Physics II. I want to predict the Nuclear Physics test scores for the 400 students using the methods presented in the async talk. Am I asking too much?

RavinKumar · December 15, 2023, 3:27pm

This was the second question in the chat

I’m trying to predict the survival of ropes and I can measure things such as the force applied on the rope and time the rope is in service. However, one of the things that determines the life of the rope would be abrasions which I can’t measure. Can I input that missing information into a model or is that too much

foabodo · December 15, 2023, 6:42pm

Is anyone aware of public data sets that are well suited for use in exploring probabilistic IRT?

purna135 · December 20, 2023, 6:48pm

If you missed our live Q&A session the recording is now available on YouTube:

foabodo · February 15, 2024, 2:05am

I found a near-ideal dataset in this paper! Essay scoring by multiple graders per student with many tens of thousands of data points.

Topic		Replies	Views
[Online Meetup] Bayesian Item Response Modelling in PyMC (October 26, 2022) community	0	381	October 20, 2022
Collaborate on a PyMCon talk? PyMCon Web Series	3	454	August 21, 2023
Partial Missing Multivariate Observation and What to Do With Them by Junpeng Lao PyMCon2020	4	1727	October 31, 2020
Missing values in a model? Questions	12	4729	November 7, 2018
Dealing with missing data and custom distribution Questions	13	2182	March 14, 2021