New PyMCon Talk Released: Bayesian Causal Modeling by Thomas Wiecki & Ben Vincent

purna135 · September 15, 2023, 12:49pm

Welcome to the 10th event of the PyMCon Web Series! As part of this series, most events will have both an asynchronous component and a live Q&A.

Speaker: Thomas Wiecki, CEO & founder of PyMC Labs.
Event type: Recorded Talk with Live Q&A
Q&A Date/Time: 2023-09-28T13:00:00Z (subscribe here for email updates)
Register for Q&A: Meetup event (to get the Zoom link)
Website: PyMCon Events · PyMCon Web Series

NOTE: This session is exclusively for Q&A. We kindly request that you watch the recording before joining the event. Plus, The event will be recorded. Subscribe to the PyMC YouTube channel for notifications.

Abstract of the talk:

Causal analysis is rapidly gaining popularity, but why? Machine learning methods might help us predict what’s going to happen with great accuracy, but what’s the value of that if it doesn’t tell us what to do to achieve a desirable outcome? Without a causal understanding of the world, it’s often impossible to identify which actions lead to a desired outcome.

Causal analysis is often embedded in a frequentist framework, which comes with some well-documented baggage. In this talk, Thomas will present how we can super-charge PyMC for Bayesian Causal Analysis by using a powerful new feature: the do operator.

Content

Slides:

Code:

About the Speaker:

Dr. Thomas Wiecki
Dr. Thomas Wiecki is an author of PyMC, the leading platform for statistical data science. To help businesses solve some of their trickiest data science problems, he assembled a world-class team of Bayesian modelers and founded PyMC Labs – the Bayesian consultancy. He did his PhD at Brown University, studying cognitive neuroscience.

Connect with Thomas:
Website: http://www.pymc-labs.com/
Twitter: https://twitter.com/twiecki
GitHub: twiecki (Thomas Wiecki) · GitHub

Sponsor

We thank our sponsors for supporting PyMC and the PyMCon Web Series. If you would like to sponsor us, contact us for more information.

Adia Lab is an independent, Abu Dhabi-based laboratory dedicated to basic and applied research in data and computational sciences.
ADIA Lab focuses on societally-important topics such as climate change and energy transition, blockchain technology, financial inclusion and investing, decision making, automation, cybersecurity, health sciences, education, telecommunications, and space, by conducting cutting-edge research in Data Science, Artificial Intelligence, Machine Learning, and High-Performance Computing.

Galen_Seilis · September 28, 2023, 1:09pm

Does the do-operator in PyMC work with time series? If I intervene on a state at time t, does this propagate through to later states in the model?

JAB · September 28, 2023, 1:24pm

In your example, (if I recall correctly) you implemented the do(z) operator with a binary variable. How does this change when z is a continuous variable? I believe this may be relevant to your hello fresh example, but I am not sure if I am making the connection.

Galen_Seilis · September 28, 2023, 1:36pm

What changes to Bayesian workflow (Gelman et al 2020) should we consider making when we’re also trying to do “causal workflow”?

olivares-j · September 28, 2023, 1:42pm

Hi, super cool talk. I would like to know, When a node is fixed with the do operator into a certain value, What happens with the parent nodes (the ones above in the hierarchy)? Are those sampled from the prior or simple not used in the model anymore? Thanks!

ricardoV94 · September 28, 2023, 4:36pm

You would probably need to create a new predictive model to intervene more granularly inside a time-series (so manually, instead of using do to do that for you). Something like the forecasting examples in this blogpost: Out of model predictions with PyMC - PyMC Labs

ricardoV94 · September 28, 2023, 4:38pm

If they are not connected to the likelihood through any other path, then yes, they will be sampled from the prior (in prior and posterior sampling). There is a kwarg in do to remove such variables: prune_vars=True.

https://www.pymc.io/projects/docs/en/stable/api/model/transform/generated/pymc.model.transform.conditioning.do.html

ricardoV94 · September 28, 2023, 4:40pm

The type of variable being intervened upon does not change the process. Do you have a specific concern in mind?

JAB · September 29, 2023, 1:37pm

EDIT: I replied to the wrong spot so I am moving it to the correct reply.

Hi Ricardo,

Thomas was able to answer this in the discussion. He pointed out that we get so used to thinking about distributions that we forgot about the concept of point estimates altogether . He pointed out that it does not matter if z is discrete or continuous; however, we do need to remember that do(z) will be a constant value. That clarified the issue for me.

The second question I asked was more clarification about do vs observe in time series problems. I was wondering if we can treat observe as the do operator if we just replace our data with a simulated time series using observe. He pointed out that, no, we cannot do that because observe just replaces the data while maintaining the graph structure, whereas do breaks the graph structure to isolate that point of intervention. That helped clarify the difference between these two operators for me.

Thanks all, this was great!

purna135 · October 2, 2023, 9:29am

If you couldn’t attend the live Q&A session, you can watch the recording on YouTube. Here’s the link:

Topic		Replies	Views
🚀 PyMC 5.8.0: What's New? News	0	452	September 12, 2023
Distributional do-operator? v5 modeling	1	496	September 16, 2023
Do Operator not working correctly with deterministic function v5 bug , modeling	1	233	November 15, 2023
Bayesian causal inference Questions	1	1493	March 27, 2020
Do operator on multiple variables in model v5 modeling	6	487	October 17, 2023

New PyMCon Talk Released: Bayesian Causal Modeling by Thomas Wiecki & Ben Vincent

Abstract of the talk:

Content

About the Speaker:

Sponsor

Related topics