r = 0.0614
We calculated the Pearson correlation coefficient between production budget and IMDb rating across every film in our database with both data points โ 11,783 films. The result: r = 0.0614.
In statistics, a Pearson r of 0.06 means the relationship between two variables is essentially nonexistent. For context: r = 0 means zero correlation. r = 0.1 is considered negligible. r = 0.3 is weak. Our measured r = 0.06 explains less than 0.4% of the variance in film quality. Budget accounts for virtually none of what makes a film good or bad.
This is one of the most robust findings in our entire dataset, and it contradicts one of the most deeply held assumptions in the film industry: that more money means better movies.
The Expensive Disasters
The data is merciless on high-budget failures. Films that spent $100 million or more yet achieved some of the lowest ratings in our database:
- Saving Bikini Bottom: The Sandy Cheeks Movie (2024) โ $100M budget, IMDb 3.6
- The Cat in the Hat (2003) โ $109M budget, IMDb 4.2
- Space Jam: A New Legacy (2021) โ $150M budget, IMDb 4.5, RT 26%
- Jupiter Ascending (2015) โ $176M budget, IMDb 5.3
- Green Lantern (2011) โ $200M budget, IMDb 5.5
These aren't rare outliers โ they're systematic. Among films with budgets above $150 million, the average IMDb rating is only 6.5, barely above the database mean of 6.56. Spending twice the median budget buys you essentially zero quality improvement.
The Cheap Masterpieces
Meanwhile, at the other end of the budget spectrum:
- Paranormal Activity (2007) โ $15K budget, IMDb 6.3, gross $193M
- The Blair Witch Project (1999) โ $60K budget, IMDb 6.5, gross $249M
- Primer (2004) โ $7K budget, IMDb 6.8
- Once (2007) โ $150K budget, IMDb 7.9, Oscar winner
- Moonlight (2016) โ $1.5M budget, IMDb 7.4, Best Picture winner
The pattern is clear: quality is not a function of investment. Some of the most critically acclaimed and commercially successful films in history were made for less than the catering budget of a single Marvel set.
Why the Correlation Is (Slightly) Positive
The r of 0.06 is technically positive, not zero. Why? Because high budgets do buy one thing: production value โ better visual effects, bigger stars, wider distribution. These factors nudge ratings very slightly upward for action and sci-fi films where spectacle matters. But the effect is so small that it's drowned out by the far more important factors: script quality, direction, and storytelling โ none of which correlate with spending.
The Formula Implication
Our Master Score formula does not include budget as a variable, and this analysis explains why. If budget predicted quality, it would be a useful feature. At r = 0.06, including budget would add noise, not signal. The features that do predict quality in our screenplay analysis โ dialogue ratio, scene count, vocabulary richness, sentiment patterns โ are all attributes of the writing, not the spending.
The data's message is unambiguous: if you want to predict whether a film will be good, look at the script. Don't look at the budget. The correlation is 0.06 โ and that number is the death certificate for the assumption that money makes movies better.
