Why We Are Terrible at Estimating
Humans are systematically bad at estimating how long tasks will take. This is not a personal failing. It is a well-documented cognitive bias called the planning fallacy, first described by psychologists Daniel Kahneman and Amos Tversky. The planning fallacy states that people consistently underestimate the time, cost, and risk of future actions while overestimating the benefits.
The evidence is everywhere. Software projects routinely run over schedule. Home renovations take twice as long as quoted. Even simple errands take longer than expected. Studies show that people underestimate task duration by 25 to 50 percent on average, even for tasks they have done before.
The good news is that estimation accuracy is a skill that can be improved. With the right techniques and a commitment to tracking your estimates against reality, you can dramatically improve your ability to plan realistic schedules.
The Psychology of Estimation Bias
Optimism Bias
When estimating future tasks, we imagine the best-case scenario. We picture ourselves working without interruption, encountering no obstacles, and making no mistakes. This sunny vision rarely matches reality, but it consistently shapes our estimates.
Anchoring to Best Case
Once we have imagined the best-case scenario, our estimate anchors to it. Even when we try to add buffer time, we adjust insufficiently from this anchor. A task we imagine taking two hours might get estimated at two and a half hours, when the realistic average is four hours.
Neglecting Overhead
Estimates typically account for the core work but not the surrounding activities. Setting up, context switching, clarifying requirements, getting feedback, making revisions, and wrapping up all take time that rarely appears in estimates.
The Uniqueness Bias
We tend to view each task as unique, even when it is similar to tasks we have done before. This prevents us from using past experience as a guide. We think "this project is different" even when the fundamental characteristics are the same.
Social Pressure
In professional settings, there is often implicit or explicit pressure to provide optimistic estimates. Saying "this will take three weeks" is less welcome than "this will take one week," even when three weeks is more accurate. This pressure distorts estimates downward.
Technique 1: Reference Class Forecasting
What It Is
Reference class forecasting, developed by Bent Flyvbjerg, counters the planning fallacy by using data from similar past tasks rather than relying on your intuition about the current task.
Instead of asking "how long will this take?" you ask "how long did similar tasks take in the past?"
How to Apply It
- Identify the reference class: What category of task is this? A blog post, a bug fix, a client presentation, a feature implementation?
- Gather data: How long have similar tasks taken historically? Use your task management history, time tracking data, or calendar records.
- Use the distribution: Rather than picking one number, look at the range. What was the fastest similar task? The slowest? The median?
- Estimate from the outside: Base your estimate on the historical data rather than your inside view of this specific task.
Example
You need to write a technical specification. Instead of thinking about this specific spec, you look at the last five specs you wrote:
| Spec | Estimated Time | Actual Time | |---|---|---| | API redesign | 4 hours | 7 hours | | Payment integration | 3 hours | 5 hours | | User permissions | 4 hours | 4.5 hours | | Search feature | 2 hours | 6 hours | | Dashboard analytics | 3 hours | 4 hours |
The median actual time is 5 hours. The range is 4 to 7 hours. A reasonable estimate for the new spec is 5 hours, with a range of 4 to 7 hours. This is far more accurate than the 3 hours you might have estimated from your inside view.
Technique 2: The Multiplier Method
What It Is
The multiplier method is a simpler version of reference class forecasting. You make your best intuitive estimate, then multiply it by a factor that accounts for your historical accuracy.
Finding Your Multiplier
Track your estimates against actuals for 20 to 30 tasks. Divide actual time by estimated time for each one to get the ratio. Your multiplier is the average of these ratios.
Common multipliers by domain:
| Domain | Typical Multiplier | |---|---| | Software development | 2.0 - 3.0x | | Writing and content | 1.5 - 2.0x | | Administrative tasks | 1.2 - 1.5x | | Creative work | 2.0 - 4.0x | | Meetings and calls | 1.1 - 1.3x | | Research | 2.0 - 3.0x |
How to Apply It
- Make your intuitive estimate
- Multiply by your personal multiplier
- That is your working estimate
Example: you estimate a task will take 2 hours and your historical multiplier is 1.8x. Your working estimate is 3.6 hours, round to 4.
The multiplier method is less precise than reference class forecasting but much easier to apply. It requires minimal data and can be implemented immediately.
Technique 3: Three-Point Estimation
What It Is
Three-point estimation asks for three scenarios:
- Optimistic (O): If everything goes perfectly, how long would this take?
- Most Likely (M): Given normal conditions, how long would this take?
- Pessimistic (P): If things go wrong, how long could this take?
The PERT Formula
The Program Evaluation and Review Technique combines these into a weighted average:
Expected Time = (O + 4M + P) / 6
Example:
- Optimistic: 2 hours
- Most Likely: 4 hours
- Pessimistic: 10 hours
- Expected: (2 + 16 + 10) / 6 = 4.7 hours
Why Three Points Work
Three-point estimation forces you to consider scenarios beyond the best case. The pessimistic estimate, in particular, is valuable because it surfaces risks and complications that single-point estimates ignore.
It also produces a range rather than a single number, which is inherently more honest. Saying "this will take 3 to 7 hours" is more useful than saying "this will take 4 hours" because it communicates uncertainty.
Technique 4: Task Decomposition
What It Is
Large tasks are harder to estimate than small ones. Break a task into subtasks, estimate each subtask individually, and sum the estimates. The resulting total is typically more accurate than a single estimate for the whole.
Why It Works
Decomposition forces you to think through the actual steps involved. It surfaces hidden work that a high-level estimate would miss. It also reduces the scope of each individual estimate, which reduces error.
How to Apply It
- Break the task into subtasks of 30 minutes to 2 hours each
- Estimate each subtask independently
- Add 10 to 20 percent buffer for transitions and overhead between subtasks
- Sum the estimates plus buffer
Example: "Build user dashboard" is hard to estimate. But decomposing it reveals the actual work:
- Design layout mockup: 2 hours
- Set up data queries: 1.5 hours
- Build chart components: 3 hours
- Implement filters: 2 hours
- Write tests: 1.5 hours
- Code review and revisions: 1 hour
- Total: 11 hours
- Plus 15% buffer: 12.7 hours, round to 13
Compare this to the "about a day" estimate you might have given without decomposition.
Tracking Estimates vs. Actuals
Why Tracking Matters
You cannot improve what you do not measure. Tracking your estimates against actual completion times is the single most effective way to improve estimation accuracy over time.
What to Track
For each task, record:
- Estimated duration at the time of creation
- Actual duration when the task is complete
- Ratio (actual / estimated)
- Category (type of task)
- Complexity (simple, moderate, complex)
Using Your Data
After tracking 30 or more tasks, you will have enough data to identify patterns:
- Which types of tasks do you estimate well? Which do you consistently underestimate?
- Do you estimate better for simple tasks than complex ones?
- Has your accuracy improved over time?
- What is your average ratio by category?
This data feeds directly into the multiplier method and reference class forecasting, creating a virtuous cycle of improving accuracy.
When you plan your day using SettlTM's daily capacity planning, estimated durations determine how many tasks fit into your available hours. Accurate estimates mean realistic daily plans that you can actually complete, rather than aspirational lists that breed frustration. Start planning with accurate capacity to see how estimated durations inform your daily schedule. The Pomodoro timer can also help you track actual working time against your estimates.
Common Estimation Mistakes
Confusing Effort with Duration
Effort is how many hours of work a task requires. Duration is the calendar time from start to finish. A task might require 4 hours of effort but take 3 days of duration because you have meetings, other priorities, and waiting time in between.
Always clarify which one you are estimating. For daily planning, effort is what matters. For project scheduling, duration is what matters.
Ignoring Dependencies and Wait Time
Tasks that depend on other people always take longer than tasks you can do independently. Waiting for feedback, waiting for access, waiting for decisions. These waits do not appear in effort estimates but dramatically impact duration.
Estimating Under Pressure
Never estimate on the spot in a meeting. The social pressure to give a low number is too strong. Ask for time to think about it, check your historical data, and provide a considered estimate later.
Round Number Bias
Humans gravitate toward round numbers: 1 hour, 2 hours, half a day, a week. Real tasks do not come in round-number sizes. If your estimate comes out to 1 hour and 40 minutes, leave it at that. Rounding down to 1 hour creates systematic error.
Failing to Account for Quality Standards
An estimate for "write a blog post" depends entirely on the expected quality level. A quick draft takes 2 hours. A polished, well-researched article takes 8. Make sure your estimate reflects the actual quality standard, not just the core deliverable.
Estimation in Team Contexts
Planning Poker
In agile teams, planning poker uses group estimation to counteract individual biases. Team members independently estimate a task, then reveal their estimates simultaneously. Large discrepancies trigger discussion that often surfaces important information about complexity or risk.
Velocity-Based Estimation
Instead of estimating in hours, agile teams often estimate in relative points and use historical velocity (points completed per sprint) to predict timelines. This approach sidesteps the planning fallacy by relying on demonstrated throughput rather than optimistic forecasts.
Estimation Agreements
Teams should agree on what their estimates include. Does an estimate for a feature include testing? Documentation? Code review? Design? Without agreement, two people estimating the same task might be estimating different scopes of work.
Building an Estimation Practice
Improving your estimation is a long-term practice, not a one-time technique. Here is a practical routine:
- Every task: Record your estimate when you create the task
- After completion: Record the actual time and the ratio
- Weekly review: Scan your ratios for the week. Notice patterns.
- Monthly analysis: Calculate your average multiplier by task category. Update your estimation approach.
- Quarterly calibration: Review whether your accuracy is improving. If not, try a different technique.
Building an Estimation Culture on Teams
Estimation as a Team Practice
When teams estimate together, individual biases partially cancel out. One person's optimism may be balanced by another's conservatism. The conversation itself surfaces important information about task complexity that solo estimation misses.
Calibration Sessions
Periodically, teams should review their estimation accuracy as a group. Pull up completed tasks with their estimates and actuals. Calculate the team's collective multiplier. Discuss which types of tasks are consistently underestimated and why.
These calibration sessions serve double duty: they improve future estimates and they build shared understanding of task complexity. When everyone sees that database migrations consistently take 2.5 times the estimate, the whole team adjusts their expectations going forward.
Estimation Norms
Teams benefit from explicit norms about estimation:
- Estimates include testing and review time unless explicitly stated otherwise
- Estimates assume normal conditions, not best case or worst case
- Uncertainty is communicated as a range, not a single number
- Estimates are updated when new information emerges, without stigma
- Actual times are tracked without blame or judgment
These norms create psychological safety around estimation, which improves accuracy because people are not pressured to provide unrealistically low numbers.
The Planning Horizon Effect
Estimation accuracy degrades as the time horizon increases. You can estimate today's tasks within 20 percent. Next week's tasks within 40 percent. Next month's tasks within 100 percent. Six months out, estimates are essentially guesses.
This has practical implications for how you plan: create detailed estimates only for the near term. Use progressively rougher estimates as you look further out. Do not waste time creating precise estimates for work that is months away, because the work itself will likely change before you get to it. This principle supports the sprint-based approach where you estimate in detail only for the current sprint and use rough sizing for everything in the backlog.
Evidence-Based Scheduling
Some project management tools and methodologies track estimation accuracy automatically and use historical data to adjust future projections. The tool learns your personal multiplier over time and applies it transparently. This removes the human effort from calibration while continuously improving accuracy. Even without specialized tools, a simple spreadsheet tracking estimates versus actuals by task category provides the same calibration data with minimal overhead.
Key Takeaways
- The planning fallacy causes systematic underestimation. Awareness alone does not fix it. You need structured techniques.
- Reference class forecasting uses historical data from similar tasks to ground your estimates in reality rather than optimism.
- The multiplier method is the simplest improvement: estimate intuitively, then multiply by your historical accuracy ratio.
- Task decomposition improves accuracy by forcing you to think through actual steps and surface hidden work.
- Tracking estimates vs. actuals is the highest-leverage practice. You cannot improve what you do not measure.
Frequently Asked Questions
How do I estimate tasks I have never done before?
Use three-point estimation and lean heavily on the pessimistic case. For truly novel tasks, your first estimate should be treated as a rough guess. Update it as you learn more about the actual work involved.
Should I add buffer time to every estimate?
Rather than adding arbitrary buffers, use the multiplier method. This provides a principled adjustment based on your actual historical accuracy rather than a random padding amount.
How long does it take to get good at estimating?
Most people see meaningful improvement after tracking 30 to 50 tasks. Significant accuracy gains typically appear within three to six months of consistent practice.
What is a good estimation accuracy target?
Aim to have your actuals within 20 percent of your estimates for at least 70 percent of tasks. Perfect accuracy is neither achievable nor necessary. The goal is to be consistently close enough for reliable planning.
How do I handle tasks that get interrupted or paused?
Track actual working time separately from calendar time. If a 4-hour task gets spread across three days, the effort estimate was 4 hours regardless of the duration. Use this distinction when calibrating your multiplier.
