What is the probability Harris wins? Building a Statistical Model.

By . Updated: (first published )

After Joe Biden made the historic decision to exit the presidential race, the country no longer has to choose between two octogenarians. It's a new exciting time with lots of uncertain questions. In this article we'll try to see just how uncertain we are, and build a probabilistic model of whether Kamala Harris will win if nominated.

There are lots of online election forecasts (e.g., from Silver et al., Morris et al., Gelman et al., among others). So why build another one? In a previous article I created an election model to help understand if Biden should drop out (focusing on Gretchen Whitmer as a replacement). That had a motivation and rhetorical purpose. I partially made a version for Harris, but didn't publish before Biden dropped out. So while this Harris model is now mostly recreational, it has some potential contributions for those interested in election modeling:

  • Transparency - Each part tries to include links to corresponding source code
  • Analysis and visualization of the amount of available polling (Section 1.2). This is to model the potentially limited initial Harris polling.
  • Alternative approach/presentation of errors and poll movement (Section 2)
  • Timeliness/impatience - At time of writing, prominent modelers like FiveThirtyEight, Silver, and the Economist have/had not released a version for Harris yet. This gives one initial estimate.

For those just looking for a number, here's jumping ahead for the top-line:

This estimate should automatically update four times a day with the newest polling data. For reference, here's archived past versions.

JulAugSepOctNov
21πŸ‘΄πŸ»
28
42%
4
11
18
25
1
8
15
22
29
6
13
20
27
3
22
29
43%
5
12
19
26
2
9
16
23
30
7
14
21
28
4
23
30
47%
6
13
20
27
3
10
17
24
1
8
15
22
29
5πŸ—³οΈ
24
31
49%
7
14
21
28
4
11
18
25
2
9
16
23
30
25
1
49%
8
15
22
29
5
12
19
26
3
10
17
24
31
26
44%
2
50%
9
16
23
30
6
13
20
27
4
11
18
25
1
27
44%
3
49%
10
17
24
31
7
14
21
28
5
12
19
26
2

We'll go through each of the steps to reach this estimate. The general approach follows that of other polls-based models. We create an average of polls in each swing state, estimate the expected polling miss, and then estimate how polls might move. Then we run thousands of random simulationsto get a fractionwhere each candidate wins.

Where are Polls Today

Gathering Data

The first step is gathering polling data (sourced via FiveThirtyEight).

Not all pollsters are equal, with some pollsters having a better track record. Thus, we weight each poll. Our weighting is intended to be scaled where 1.0 is the value of a poll from a top-rated pollster (eg, Siena/NYT, Emerson College, Marquette University, etc.) that interviewed their sample yesterday or sooner.

Less reliable/transparent pollsters are weighted as some fraction of 1.0. Additionally, older polls are weighted lessWeight decays with an approximate half-life of 9 days. We use between the start and end date, and wait 1.5 days to start decay.. Polls before Biden dropped out carry ΒΌ weight. This function is only a heuristic estimate (see codefor exact definition).

If a pollster reports multiple numbers (eg, with or without RFK Jr., registered voters or likely voters, etc), we use the version with the largest sum covered by the Democrat and Republican.

National Polls
WeightPollster (rating)DatesHarris: TrumpHarris Share
0.76YouGov (2.9)07/27-07/3046% : 44%51.1
0.61Ipsos (2.8)07/26-07/2843% : 42%50.6
0.55RMG Research (2.3)07/29-07/3147% : 42%52.8
0.53Civiqs (2.5)07/27-07/3049% : 45%52.1
0.51Ipsos (2.8)07/24-07/2538% : 38%50.0
0.49Siena/NYT (3.0)07/22-07/2447% : 48%49.5
0.49AtlasIntel (2.7)07/23-07/2548% : 50%48.9
0.47YouGov (2.9)07/22-07/2344% : 46%48.9
0.43Ipsos (2.8)07/22-07/2344% : 42%51.2
0.42Marist (2.9)07/22-07/2245% : 46%49.5
0.30RMG Research (2.3)07/22-07/2346% : 48%48.9
0.30Leger (2.0)07/26-07/2848% : 41%53.9
0.28Morning Consult (1.8)07/26-07/2847% : 46%50.5
0.27Redfield & Wilton Strategies (1.8)07/29-07/2945% : 43%51.1
0.26Morning Consult (1.8)07/25-07/2747% : 45%51.1
0.25Angus Reid (2.0)07/23-07/2544% : 42%51.2
0.24Morning Consult (1.8)07/24-07/2646% : 45%50.5
0.23HarrisX/Harris Poll (1.6)07/26-07/2848% : 52%48.0
0.22Morning Consult (1.8)07/23-07/2546% : 45%50.5
0.21American Pulse (?)07/26-07/2950% : 48%50.6
0.21Morning Consult (1.8)07/22-07/2446% : 45%50.5
0.20ActiVote (?)07/24-07/2949% : 51%48.7
0.20Big Data Poll (?)07/26-07/2849% : 51%49.4
0.20SurveyMonkey (1.9)07/22-07/2438% : 39%49.4
0.19Clarity (1.6)07/22-07/2648% : 47%50.5
0.19Florida Atlantic University/Mainstreet Research (?)07/26-07/2748% : 46%51.1
0.18HarrisX (1.6)07/22-07/2549% : 51%49.0
0.16Fabrizio/GBAO (?)07/23-07/2547% : 49%49.0
0.15Big Village (1.6)07/22-07/2443% : 44%49.1
0.11YouGov (2.9)07/21-07/2341% : 44%48.2
0.11Change Research (1.4)07/22-07/2444% : 43%50.6
0.10YouGov (2.9)07/19-07/2246% : 46%50.0
0.09Echelon Insights (2.7)07/19-07/2147% : 49%49.0
0.08Quinnipiac (2.8)07/19-07/2147% : 49%49.0
0.08YouGov (2.9)07/16-07/1848% : 51%48.5
0.06YouGov (2.9)07/13-07/1639% : 44%47.0
0.06Ipsos (2.8)07/15-07/1644% : 44%50.0
0.06SurveyUSA (2.8)07/12-07/1542% : 45%48.3
0.05Morning Consult (1.8)07/21-07/2245% : 47%48.9
0.04Marist (2.9)07/09-07/1050% : 49%50.5
0.04Beacon/Shaw (2.8)07/07-07/1048% : 49%49.5
0.04YouGov (2.9)07/07-07/0938% : 42%47.5
0.04Emerson (2.9)07/07-07/0843% : 49%46.6
0.03ActiVote (?)07/21-07/2350% : 50%49.5
0.03Ipsos (2.8)07/05-07/0949% : 47%51.0
0.03HarrisX (1.6)07/19-07/2147% : 53%47.0
0.03McLaughlin (0.5)07/23-07/2945% : 47%48.9
0.03Hart/POS (2.6)07/07-07/0945% : 47%48.9
0.03Noble Predictive Insights (2.4)07/08-07/1144% : 48%47.8
0.03SoCal Research (?)07/21-07/2143% : 51%45.7
0.03Morning Consult (1.8)07/15-07/1545% : 46%49.5
0.03Florida Atlantic University/Mainstreet Research (?)07/19-07/2144% : 49%47.1
0.02SoCal Research (?)07/17-07/1744% : 52%45.8
0.02Ipsos (2.8)07/01-07/0242% : 43%49.4
0.02HarrisX (1.6)07/13-07/1548% : 52%48.0
0.02YouGov (2.9)06/28-07/0145% : 47%48.9
0.02Data for Progress (2.7)06/28-06/2845% : 48%48.4
0.02Big Village (1.6)07/12-07/1437% : 42%47.3
0.01Manhattan Institute (?)07/07-07/1346% : 48%48.9
0.01Redfield & Wilton Strategies (1.8)07/08-07/0837% : 44%45.7
0.01J.L. Partners (1.6)07/01-07/0338% : 49%43.7
0.01Split Ticket/Data for Progress (?)07/01-07/0346% : 46%50.0
0.01HarrisX (1.6)06/28-06/3047% : 53%47.0
0.01CNN/SSRS (?)06/28-06/3045% : 47%48.9
0.00Bendixen & Amandi International (1.0)07/02-07/0642% : 41%50.6
Sum 11.0TotalAvg 50.2
Pennsylvania
WeightPollster (rating)DatesHarris: TrumpHarris Share
0.92From Natl. Avg. (0.91β‹…x + 3.62)49.3
0.49Beacon/Shaw (2.8)07/22-07/2449% : 49%50.0
0.46Emerson (2.9)07/22-07/2349% : 51%48.9
0.33Susquehanna (2.3)07/22-07/2847% : 43%52.2
0.26Morning Consult (1.8)07/24-07/2846% : 50%47.9
0.19PPP (1.4)07/29-07/3047% : 48%49.5
0.17Public Opinion Strategies (1.7)07/23-07/2948% : 45%51.6
0.17Quantus Polls and News (?)07/27-07/2846% : 48%49.4
0.17Redfield & Wilton Strategies (1.8)07/22-07/2442% : 46%47.7
0.15Bullfinch (?)07/23-07/2548% : 47%50.5
0.04Siena/NYT (3.0)07/09-07/1147% : 48%49.5
0.04Civiqs (2.5)07/13-07/1644% : 46%48.9
0.03InsiderAdvantage (2.0)07/15-07/1640% : 47%46.0
0.02SoCal Research (?)07/20-07/2146% : 50%47.9
0.02North Star Opinion Research (1.2)07/20-07/2345% : 47%48.9
0.02PPP (1.4)07/17-07/1843% : 45%48.9
0.01PPP (1.4)07/11-07/1245% : 51%46.9
Sum 3.5TotalAvg 49.6
Wisconsin
WeightPollster (rating)DatesHarris: TrumpHarris Share
0.81From Natl. Avg. (0.91β‹…x + 3.74)49.5
0.49Beacon/Shaw (2.8)07/22-07/2449% : 50%49.5
0.46Emerson (2.9)07/22-07/2351% : 49%50.6
0.25Morning Consult (1.8)07/24-07/2849% : 47%51.0
0.17Public Opinion Strategies (1.7)07/23-07/2948% : 46%51.1
0.15Redfield & Wilton Strategies (1.8)07/22-07/2444% : 44%50.0
0.04Civiqs (2.5)07/13-07/1648% : 48%50.0
0.01PPP (1.4)07/10-07/1148% : 49%49.5
0.01North Star Opinion Research (1.2)07/06-07/1047% : 48%49.5
Sum 2.4TotalAvg 50.0
Georgia
WeightPollster (rating)DatesHarris: TrumpHarris Share
0.91From Natl. Avg. (0.71β‹…x + 11.40)47.0
0.45Emerson (2.9)07/22-07/2349% : 51%48.9
0.26Morning Consult (1.8)07/24-07/2847% : 47%50.0
0.21Landmark Communications (2.1)07/22-07/2247% : 48%49.3
0.20PPP (1.4)07/29-07/3048% : 47%50.5
0.18Trafalgar Group/InsiderAdvantage (?)07/29-07/3047% : 49%49.0
0.17Redfield & Wilton Strategies (1.8)07/22-07/2442% : 47%47.2
0.15SoCal Research (?)07/25-07/2646% : 50%48.2
0.04U. Georgia SPIA (2.2)07/09-07/1846% : 50%47.6
0.03InsiderAdvantage (2.0)07/15-07/1637% : 47%43.8
0.01Florida Atlantic University/Mainstreet Research (?)07/14-07/1544% : 49%47.3
0.01Florida Atlantic University/Mainstreet Research (?)07/12-07/1343% : 49%46.7
Sum 2.6TotalAvg 48.2
Michigan
WeightPollster (rating)DatesHarris: TrumpHarris Share
0.90From Natl. Avg. (1.13β‹…x + -7.34)49.6
0.49Beacon/Shaw (2.8)07/22-07/2449% : 49%50.0
0.45Emerson (2.9)07/22-07/2349% : 51%49.1
0.25Morning Consult (1.8)07/24-07/2853% : 42%55.8
0.17Public Opinion Strategies (1.7)07/23-07/2945% : 45%50.0
0.15SoCal Research (?)07/25-07/2646% : 49%48.4
0.15Redfield & Wilton Strategies (1.8)07/22-07/2441% : 44%48.2
0.14Glengariff Group Inc. (1.5)07/22-07/2442% : 41%50.2
0.04Civiqs (2.5)07/13-07/1646% : 46%50.0
0.02PPP (1.4)07/17-07/1841% : 46%47.1
0.01PPP (1.4)07/11-07/1246% : 48%48.9
Sum 2.8TotalAvg 50.0
North Carolina
WeightPollster (rating)DatesHarris: TrumpHarris Share
0.91From Natl. Avg. (1.08β‹…x + -7.29)47.0
0.25Morning Consult (1.8)07/24-07/2846% : 48%48.9
0.15Redfield & Wilton Strategies (1.8)07/22-07/2443% : 46%48.3
0.02PPP (1.4)07/19-07/2044% : 48%47.8
Sum 1.3TotalAvg 47.5
Arizona
WeightPollster (rating)DatesHarris: TrumpHarris Share
0.94From Natl. Avg. (0.99β‹…x + -1.85)48.0
0.45Emerson (2.9)07/22-07/2347% : 53%47.4
0.26Morning Consult (1.8)07/24-07/2849% : 47%51.0
0.19PPP (1.4)07/29-07/3047% : 49%49.0
0.17Public Opinion Strategies (1.7)07/23-07/2943% : 48%47.3
0.15Redfield & Wilton Strategies (1.8)07/22-07/2443% : 46%48.3
0.03InsiderAdvantage (2.0)07/15-07/1642% : 48%46.7
0.02PPP (1.4)07/19-07/2040% : 46%46.5
0.01PPP (1.4)07/10-07/1144% : 52%45.8
Sum 2.2TotalAvg 48.2
Nevada
WeightPollster (rating)DatesHarris: TrumpHarris Share
0.94From Natl. Avg. (1.52β‹…x + -28.71)47.8
0.22Morning Consult (1.8)07/24-07/2847% : 45%51.1
0.17Public Opinion Strategies (1.7)07/23-07/2945% : 46%49.5
0.14Redfield & Wilton Strategies (1.8)07/22-07/2443% : 45%48.9
0.03InsiderAdvantage (2.0)07/15-07/1640% : 50%44.4
Sum 1.5TotalAvg 48.5
Florida
WeightPollster (rating)DatesHarris: TrumpHarris Share
0.93From Natl. Avg. (1.73β‹…x + -41.47)45.4
0.50U. North Florida (2.8)07/24-07/2742% : 49%46.2
0.15Redfield & Wilton Strategies (1.8)07/22-07/2439% : 47%45.3
0.03InsiderAdvantage (2.0)07/15-07/1639% : 49%44.2
Sum 1.6TotalAvg 45.6

Table Source

Estimating Poll Miss

Morris (2024) at FiveThirtyEight reports that the polling average typically misses the actual swing state result by about ~2 points for a given candidate (or ~3.8 points for the margin). This is pretty remarkable. Even combining dozens of pollsters each asking thousands of people their vote right before the election, we still expect to be several points off. Elections are hard to predict.

Our current situation is even more uncertain. The ~2 points of miss is for a typical candidate on election day. With Harris we start in a time with potentially limited polling data. To estimate this we look at the weighted count of polls.

Right now we estimate we have the equivalent of 11.0 top-quality national polls for Harris. For comparison, we estimate we had 21.5 top-quality national polls for Biden the day before he dropped out and 58.4 top-quality polls for Biden on 2020 Election Day.

For swing state polls we apply the same weighting. To fill in gaps in swing state polling, we also combine with national polling. Each state has a different relationship to national polls. We fit a linear function (ie, a slope and an intercept) going from our custom national polling average to 538's state polling average for Biden in 2020 and 2024. We average this mapped value with available polls (its weight is somewhat arbitrarily defined as the  R2 of the linear fit). We highlight that the national polling-average was highly predictive of 538's swing state polling-averages (avg R2=0.91).

In Figure 1 we show both the weighted count of polls and the square root of the weighted count. Probability theory tells us that sampling is not linear. As an example, say we had a poll of 1000 people estimating a vote of 45% with some amount of error. If we repeated that poll and now had 2000 people, we would not be twice as confident of our estimate. Instead, we would need roughly 4000 people to be twice as confident. Having 11.0 polls now compared to 58.4 polls on election day 2020 is like having 11.0/58.40.4 as much polling information.

We make the assumption that the amount of swing state polls we had on election day 2020 was enough for the typical ~2 point average miss. We then estimate expected error in each swing state given how many polls we currently have there. We assume that only half of the average miss (ie, ~1 point) could be reduced with more polling. The other half this is half fraction is purely heuristic. More rigorous work could try to estimate this empirically of the miss is some unrecoverable error (for example due inability to contact subsets of voters, or industry-wide methodology flaws). See codefor precise details.

Using this method, we estimate that the average swing state polling miss is currently 3.5 points.

We emphasize the average miss here is just an average across thousands of simulations. The actual miss can be higher or lower in either direction. Following Morris (2024), we model the distribution of these errors as a t-distribution with five degrees of freedom.

Additionally, we assume poll misses are correlatedbetween states. Similar to our previous Whitmer model, we correlations use from the 2020 FiveThirtyEight and Economist models (via Pearce (2020)). A more pure version of this model would try to reestimate this from data.

Estimates with Poll Miss

If we use this expected miss and pretend the election was today, we would estimate a 45% chance Harris would win.

Where will Polls be in 93 Days

If a candidate is behind, they would hope for more variance in outcomes. If they are ahead, less variance is better. In addition to typical poll misses, we would expect some variance from movement polls in the next 93 days to Election Day.

Here we show the trends in 2020 and 2024. Dem Share is share of vote with just the Democrats and Republicans, ie (Dem/(Dem+Rep))·100. Keep in mind, the magnitude of shifts in this value is half that of shifts in margin.

The average 93-day movement in 2020 was 1.04 points and the average 93-day movement in Biden 2024 was 0.59 points (movements are absolute value. It could be up or down.). The largest movement observed is 3.87 points in Michigan between March 15, 2020 and June 16, 2020. During this time Covid went from an abstract concept to most Americans, to over 600,000 Americans dead while Trump mused about injecting people with disinfectants and slowing testing to make himself look better.

We attempt to estimate the mean expected move for Harris, just based on limited polling so far. We do this via a rough process of random walks sampled from her movement so far (src). We then have an estimate of an average expected move of 10.6 using just Harris 2024 data.

To get a final estimate of Harris's expected move in each state, we average together Biden 2020, Biden 2024, and Harris 2024 and blend data for a given state and the national moves. Please refer to the codefor a precise definition. This process estimates an average expected move across the 8 swing states is 4.16 points using all data.

Using this expected average, we modelthe distribution of movements as a t-distribution with 5 degrees of freedom.

Results with Movement

Here show estimated win probability taking into account both the average 3.5 point polling miss and the expected 4.16 point average poll movement.

Thus, allowing variance from movement slightly changes odds for Harris. One might intuitively expect a larger change, however we must remember that under this model, the polling miss and poll movement is assumed to be independent. As an example, we could sample a 2 point move up in polls, but this could be cancelled out by sampling a -3 point poll miss. The average combined poll miss is 5.6 points.

Hypothetical VP shifts

There has been a lot of speculation of Harris's potential VP and its impact on the race. Here we briefly explore this to get a sense of how much potential VP movement matters. Leading candidates currently include talks-like-Obama Governor Josh Shapiro from Pennsylvania, fighter-pilot-astronaut-senator Mark Kelly from Arizona, and weird-calling Tim Walz from Minnesota.

Each has their own appeals. One that has drawn attention is their affect on swing states. The simple way of doing this is to just look at a "home state advantage" for the senator. Historically this hasn't mattered much (Devine and Kopko (2020), Silver (2024)), however maybe one believes this election is different.

Here we apply a boost to Harris's vote share in Pennsylvania. The boost follows a normal distribution with a given mean and standard deviation. Note this is a shift in vote share (so half a boost in margin). We start with Election Today results under our assumed current poll miss of 3.5 avg absolute share-points, not the results with poll movement (as this presumably already includes some of the variance of events like a VP pick). We show national election-today win probability (πŸ‡ΊπŸ‡Έ) and election-today win probability in the Keystone State (πŸͺ¨).

Mean Adjustment β†’
Οƒ ↓
00.250.51.01.55.0
045%πŸ‡ΊπŸ‡Έ
45%πŸͺ¨
46%πŸ‡ΊπŸ‡Έ
48%πŸͺ¨
47%πŸ‡ΊπŸ‡Έ
51%πŸͺ¨
49%πŸ‡ΊπŸ‡Έ
57%πŸͺ¨
50%πŸ‡ΊπŸ‡Έ
63%πŸͺ¨
54%πŸ‡ΊπŸ‡Έ
90%πŸͺ¨
1.044%πŸ‡ΊπŸ‡Έ
45%πŸͺ¨
45%πŸ‡ΊπŸ‡Έ
48%πŸͺ¨
46%πŸ‡ΊπŸ‡Έ
51%πŸͺ¨
48%πŸ‡ΊπŸ‡Έ
57%πŸͺ¨
50%πŸ‡ΊπŸ‡Έ
62%πŸͺ¨
54%πŸ‡ΊπŸ‡Έ
89%πŸͺ¨
2.044%πŸ‡ΊπŸ‡Έ
45%πŸͺ¨
45%πŸ‡ΊπŸ‡Έ
48%πŸͺ¨
46%πŸ‡ΊπŸ‡Έ
51%πŸͺ¨
47%πŸ‡ΊπŸ‡Έ
56%πŸͺ¨
48%πŸ‡ΊπŸ‡Έ
61%πŸͺ¨
53%πŸ‡ΊπŸ‡Έ
87%πŸͺ¨
4.044%πŸ‡ΊπŸ‡Έ
47%πŸͺ¨
44%πŸ‡ΊπŸ‡Έ
49%πŸͺ¨
44%πŸ‡ΊπŸ‡Έ
51%πŸͺ¨
45%πŸ‡ΊπŸ‡Έ
55%πŸͺ¨
46%πŸ‡ΊπŸ‡Έ
58%πŸͺ¨
51%πŸ‡ΊπŸ‡Έ
80%πŸͺ¨

Here we show a similar table for Arizona (🌡), which is a less critical state.

Mean Adjustment β†’
Οƒ ↓
00.250.51.01.55.0
045%πŸ‡ΊπŸ‡Έ
33%🌡
45%πŸ‡ΊπŸ‡Έ
35%🌡
45%πŸ‡ΊπŸ‡Έ
37%🌡
45%πŸ‡ΊπŸ‡Έ
42%🌡
46%πŸ‡ΊπŸ‡Έ
47%🌡
47%πŸ‡ΊπŸ‡Έ
79%🌡
1.045%πŸ‡ΊπŸ‡Έ
33%🌡
45%πŸ‡ΊπŸ‡Έ
35%🌡
45%πŸ‡ΊπŸ‡Έ
38%🌡
45%πŸ‡ΊπŸ‡Έ
42%🌡
46%πŸ‡ΊπŸ‡Έ
47%🌡
47%πŸ‡ΊπŸ‡Έ
78%🌡
2.045%πŸ‡ΊπŸ‡Έ
34%🌡
45%πŸ‡ΊπŸ‡Έ
36%🌡
45%πŸ‡ΊπŸ‡Έ
39%🌡
45%πŸ‡ΊπŸ‡Έ
44%🌡
46%πŸ‡ΊπŸ‡Έ
47%🌡
47%πŸ‡ΊπŸ‡Έ
76%🌡
4.045%πŸ‡ΊπŸ‡Έ
38%🌡
45%πŸ‡ΊπŸ‡Έ
40%🌡
45%πŸ‡ΊπŸ‡Έ
41%🌡
45%πŸ‡ΊπŸ‡Έ
45%🌡
46%πŸ‡ΊπŸ‡Έ
48%🌡
47%πŸ‡ΊπŸ‡Έ
71%🌡

Reading along the top row with (Standard Deviation = 0) is just a fixed boost in that state. We see improvement over today with increasing boost. The table also helps illustrate how while assuming a massive boost and "locking in" a state like PA does definitely improve odds, its effect is limited when other states like MI and WI are still toss-ups.

One might guess that in the case like Josh Shapiro, if he's liked in PA he might also be liked in states who's voting patterns is historically correlated with PA. However, given that the VP home state advantage is already small, this effect seems even smaller, and trying to put a number on it seems perilous.

We can make wild guesses. Assume there is a mean 1 point shift in PA with a standard deviation of 1 point. Also assume other states have a covariate boost as well using a cubed correlation to PA (again using correlations via Pearce (2020)) and β…“ the magnitude of shift (Apologies for rough explanation. For more details on this part I'll defer to the code). Then we end up with 51.2%πŸ‡ΊπŸ‡Έ for Shapiro vs the non-collated 𝒩(1,1) boost result of 48.1%πŸ‡ΊπŸ‡Έ. But these are wild guesses. VP effects are so poorly understood that we don't have a basis to choose particular correlation values.

Given no good understanding of VP correlations, our best guess is probably to assume a small state shift with a high standard deviation on the effect. Still there is potential for a slight boost for Harris's position today.

With these poorly understood factors, Harris likely should focus the choice less on home states, and more on who would actually be the best US Vice President, and what internal polling is showing (which was presumably conducted).

Model Limitations

There are several limitations of this model

Polling is a limited tool: As mentioned earlier, even with data from dozens of pollsters right before Election Day, polls typically miss by several points. One thing I took away from building this model is an encouragement to just care about polls less. We are unlikely to end up in a world on election day where polls tell us much different than a coinflip.

No mean-reversion or trends: This model assumes misses in either direction equally probable. However, in reality we should expect moves far from the historical mean to be less likely (as these voters become increasingly partisan).

Poll uncertainty quantification not empirically validated: We make assumptions about the behavior of poll uncertainty for a given number of polls that might not be valid (in particular the fraction of aleatoric uncertainty is unclear). As mentioned, stronger work would better estimate this using data from past races.

Not all states: We only model 8 swing states. This is because in situations where a state like Texas or Iowa goes blue, the election is almost certainly already decided elsewhere. We also don't model the atypical way Nebraska and Maine distribute their electors.

Ignoring 3rd party votes: Better factoring in RFK Jr. might slightly change things. Also, undecided voters are essentially assumed to split equally rather than more complex schemes.

Correlations simple: This model is not completely pure, as it uses state correlations extracted from other models. Additionally, we only consider correlations at a state-level correlations, rather than more complex schemes.

Many assumptions of this model are rough and not particularly principled.

Conclusion

The race is highly uncertain. Considering both the "Election Today" results with poll misses and the poll movement estimates, the race is close to a coin flip. Making confident predictions either way is unwise.

As a reference, using polling data from July 21 this model would have estimated that Biden had a 27% chance of winning when he dropped out with 107 days to the election. Thus, with Harris estimated at 49% today, we'd estimate she so far has improved odds over Biden then. There are potentially exciting times ahead.