+ - 0:00:00
Notes for current slide
Notes for next slide

Balance in Bayesian Analysis

Dr. Dogucu

1 / 24

Bechdel Test

Alison Bechdel’s 1985 comic Dykes to Watch Out For has a strip called The Rule where a person states that they only go to a movie if it satisfies the following three rules:

  • the movie has to have at least two women in it;
  • these two women talk to each other; and
  • they talk about something besides a man.

This test is used for assessing movies in terms of representation of women. Even though there are three criteria, a movie either fails or passes the Bechdel test.

2 / 24

Different Priors, Same Data

Let π be the the proportion of movies that pass the Bechdel test.

Below there are three different people with three different priors about π.

optimist clueless feminist
Beta(14,1) Beta(1,1) Beta(5,11)

Plot their priors.

3 / 24

Priors

4 / 24

Vocabulary

Informative prior: An informative prior reflects specific information about the unknown variable with high certainty (ie. low variability).

Vague (diffuse) prior:

A vague or diffuse prior reflects little specific information about the unknown variable. A flat prior, which assigns equal prior plausibility to all possible values of the variable, is a special case.

5 / 24
  • library(fivethirtyeight) has bechdel data frame. Randomly select 20 movies from this dataset (seed = 84735)

  • Based on observed data, update the posterior for all three people. Write the distribution of the posterior.

  • Calculate the summary statistics for the prior and the posterior for all three.

  • Plot the prior, likelihood, and the posterior for all three.

  • Explain the effect of different priors on the posterior.

6 / 24
library(tidyverse)
library(fivethirtyeight)
library(bayesrules)
set.seed(84735)
7 / 24
library(tidyverse)
library(fivethirtyeight)
library(bayesrules)
set.seed(84735)
bechdel_sample <- sample_n(bechdel, 20)
8 / 24
library(tidyverse)
library(fivethirtyeight)
library(bayesrules)
set.seed(84735)
bechdel_sample <- sample_n(bechdel, 20)
count(bechdel_sample, binary)
## # A tibble: 2 x 2
## binary n
## <chr> <int>
## 1 FAIL 11
## 2 PASS 9
9 / 24

The Optimist

summarize_beta_binomial(14, 1, x = 9, n = 20)
## model alpha beta mean mode var
## 1 prior 14 1 0.9333333 1.0000000 0.003888889
## 2 posterior 23 12 0.6571429 0.6666667 0.006258503
10 / 24

The Optimist

plot_beta_binomial(14, 1, x = 9, n = 20)

11 / 24

The Clueless

summarize_beta_binomial(1, 1, x = 9, n = 20)
## model alpha beta mean mode var
## 1 prior 1 1 0.5000000 NaN 0.08333333
## 2 posterior 10 12 0.4545455 0.45 0.01077973
12 / 24

The Clueless

plot_beta_binomial(1, 1, x = 9, n = 20)

13 / 24

The Feminist

summarize_beta_binomial(5, 11, x = 9, n = 20)
## model alpha beta mean mode var
## 1 prior 5 11 0.3125000 0.2857143 0.01263787
## 2 posterior 14 22 0.3888889 0.3823529 0.00642309
14 / 24

The Feminist

plot_beta_binomial(5, 11, x = 9, n = 20)

15 / 24

Comparison

16 / 24

Same Prior, Different Data

Morteza, Nadide, and Ursula – all share the optimistic Beta(14,1) prior for π but each have access to different data. Morteza reviews movies from 1991. Nadide reviews movies from 2000 and Ursula reviews movies from 2013. How will the posterior distribution for each differ?

17 / 24

Morteza's analysis

bechdel_1991 <- filter(bechdel, year == 1991)
count(bechdel_1991, binary)
## # A tibble: 2 x 2
## binary n
## <chr> <int>
## 1 FAIL 7
## 2 PASS 6
6/13
## [1] 0.4615385
18 / 24

Morteza's analysis

plot_beta_binomial(14, 1, x = 6, n = 13)

19 / 24

Nadide's analysis

bechdel_2000 <- filter(bechdel, year == 2000)
count(bechdel_2000, binary)
## # A tibble: 2 x 2
## binary n
## <chr> <int>
## 1 FAIL 34
## 2 PASS 29
29/(34+29)
## [1] 0.4603175
20 / 24

Nadide's analysis

plot_beta_binomial(14, 1, x = 29, n = 63)

21 / 24

Ursula's analysis

bechdel_2013 <- filter(bechdel, year == 2013)
count(bechdel_2013, binary)
## # A tibble: 2 x 2
## binary n
## <chr> <int>
## 1 FAIL 53
## 2 PASS 46
46/(53+46)
## [1] 0.4646465
22 / 24

Ursula's analysis

plot_beta_binomial(14, 1, x = 46, n = 99)

23 / 24

Summary

Sequential Bayes

Consider two new analysts Paola and Mark. Paola starts with Beta(14,1) prior. She first reviews movies from 1971 and updates her belief. Then reviews movies from 1972 and updates her belief. Then reviews movies from 1973 and updates her belief. Make sure to calculate the prior and posterior distribution at each point.

Mark also starts with Beta(14,1) prior. However he reviews movies from 1971, 1972, 1973 all at once. Calculate the posterior.

\section{Data order invariance}

24 / 24

Bechdel Test

Alison Bechdel’s 1985 comic Dykes to Watch Out For has a strip called The Rule where a person states that they only go to a movie if it satisfies the following three rules:

  • the movie has to have at least two women in it;
  • these two women talk to each other; and
  • they talk about something besides a man.

This test is used for assessing movies in terms of representation of women. Even though there are three criteria, a movie either fails or passes the Bechdel test.

2 / 24
Paused

Help

Keyboard shortcuts

, , Pg Up, k Go to previous slide
, , Pg Dn, Space, j Go to next slide
Home Go to first slide
End Go to last slide
Number + Return Go to specific slide
b / m / f Toggle blackout / mirrored / fullscreen mode
c Clone slideshow
p Toggle presenter mode
t Restart the presentation timer
?, h Toggle this help
Esc Back to slideshow