Why We Are Terrible at Estimating

Humans are systematically bad at estimating how long tasks will take. This is not a personal failing. It is a well-documented cognitive bias called the planning fallacy, first described by psychologists Daniel Kahneman and Amos Tversky. The planning fallacy states that people consistently underestimate the time, cost, and risk of future actions while overestimating the benefits.

The evidence is everywhere. Software projects routinely run over schedule. Home renovations take twice as long as quoted. Even simple errands take longer than expected. Studies show that people underestimate task duration by 25 to 50 percent on average, even for tasks they have done before.

The good news is that estimation accuracy is a skill that can be improved. With the right techniques and a commitment to tracking your estimates against reality, you can dramatically improve your ability to plan realistic schedules.

The Psychology of Estimation Bias

Optimism Bias

When estimating future tasks, we imagine the best-case scenario. We picture ourselves working without interruption, encountering no obstacles, and making no mistakes. This sunny vision rarely matches reality, but it consistently shapes our estimates.

Anchoring to Best Case

Once we have imagined the best-case scenario, our estimate anchors to it. Even when we try to add buffer time, we adjust insufficiently from this anchor. A task we imagine taking two hours might get estimated at two and a half hours, when the realistic average is four hours.

Neglecting Overhead

Estimates typically account for the core work but not the surrounding activities. Setting up, context switching, clarifying requirements, getting feedback, making revisions, and wrapping up all take time that rarely appears in estimates.

The Uniqueness Bias

We tend to view each task as unique, even when it is similar to tasks we have done before. This prevents us from using past experience as a guide. We think "this project is different" even when the fundamental characteristics are the same.

Social Pressure

In professional settings, there is often implicit or explicit pressure to provide optimistic estimates. Saying "this will take three weeks" is less welcome than "this will take one week," even when three weeks is more accurate. This pressure distorts estimates downward.

Technique 1: Reference Class Forecasting

What It Is

Reference class forecasting, developed by Bent Flyvbjerg, counters the planning fallacy by using data from similar past tasks rather than relying on your intuition about the current task.

Instead of asking "how long will this take?" you ask "how long did similar tasks take in the past?"

How to Apply It

Identify the reference class: What category of task is this? A blog post, a bug fix, a client presentation, a feature implementation?
Gather data: How long have similar tasks taken historically? Use your task management history, time tracking data, or calendar records.
Use the distribution: Rather than picking one number, look at the range. What was the fastest similar task? The slowest? The median?
Estimate from the outside: Base your estimate on the historical data rather than your inside view of this specific task.

Example

You need to write a technical specification. Instead of thinking about this specific spec, you look at the last five specs you wrote:

| Spec | Estimated Time | Actual Time | |---|---|---| | API redesign | 4 hours | 7 hours | | Payment integration | 3 hours | 5 hours | | User permissions | 4 hours | 4.5 hours | | Search feature | 2 hours | 6 hours | | Dashboard analytics | 3 hours | 4 hours |

The median actual time is 5 hours. The range is 4 to 7 hours. A reasonable estimate for the new spec is 5 hours, with a range of 4 to 7 hours. This is far more accurate than the 3 hours you might have estimated from your inside view.

Technique 2: The Multiplier Method

What It Is

The multiplier method is a simpler version of reference class forecasting. You make your best intuitive estimate, then multiply it by a factor that accounts for your historical accuracy.

Finding Your Multiplier

Track your estimates against actuals for 20 to 30 tasks. Divide actual time by estimated time for each one to get the ratio. Your multiplier is the average of these ratios.

Common multipliers by domain:

| Domain | Typical Multiplier | |---|---| | Software development | 2.0 - 3.0x | | Writing and content | 1.5 - 2.0x | | Administrative tasks | 1.2 - 1.5x | | Creative work | 2.0 - 4.0x | | Meetings and calls | 1.1 - 1.3x | | Research | 2.0 - 3.0x |

How to Apply It

Make your intuitive estimate
Multiply by your personal multiplier
That is your working estimate

Example: you estimate a task will take 2 hours and your historical multiplier is 1.8x. Your working estimate is 3.6 hours, round to 4.

The multiplier method is less precise than reference class forecasting but much easier to apply. It requires minimal data and can be implemented immediately.

Technique 3: Three-Point Estimation

What It Is

Three-point estimation asks for three scenarios:

Optimistic (O): If everything goes perfectly, how long would this take?
Most Likely (M): Given normal conditions, how long would this take?
Pessimistic (P): If things go wrong, how long could this take?

The PERT Formula

The Program Evaluation and Review Technique combines these into a weighted average:

Expected Time = (O + 4M + P) / 6

Example:

Optimistic: 2 hours
Most Likely: 4 hours
Pessimistic: 10 hours
Expected: (2 + 16 + 10) / 6 = 4.7 hours

Why Three Points Work

Three-point estimation forces you to consider scenarios beyond the best case. The pessimistic estimate, in particular, is valuable because it surfaces risks and complications that single-point estimates ignore.

It also produces a range rather than a single number, which is inherently more honest. Saying "this will take 3 to 7 hours" is more useful than saying "this will take 4 hours" because it communicates uncertainty.

Technique 4: Task Decomposition

What It Is

Large tasks are harder to estimate than small ones. Break a task into subtasks, estimate each subtask individually, and sum the estimates. The resulting total is typically more accurate than a single estimate for the whole.

Why It Works

Decomposition forces you to think through the actual steps involved. It surfaces hidden work that a high-level estimate would miss. It also reduces the scope of each individual estimate, which reduces error.

How to Apply It

Break the task into subtasks of 30 minutes to 2 hours each
Estimate each subtask independently
Add 10 to 20 percent buffer for transitions and overhead between subtasks
Sum the estimates plus buffer

Example: "Build user dashboard" is hard to estimate. But decomposing it reveals the actual work:

Design layout mockup: 2 hours
Set up data queries: 1.5 hours
Build chart components: 3 hours
Implement filters: 2 hours
Write tests: 1.5 hours
Code review and revisions: 1 hour
Total: 11 hours
Plus 15% buffer: 12.7 hours, round to 13

Compare this to the "about a day" estimate you might have given without decomposition.

Tracking Estimates vs. Actuals

Why Tracking Matters

You cannot improve what you do not measure. Tracking your estimates against actual completion times is the single most effective way to improve estimation accuracy over time.

What to Track

For each task, record:

Estimated duration at the time of creation
Actual duration when the task is complete
Ratio (actual / estimated)
Category (type of task)
Complexity (simple, moderate, complex)

Using Your Data

After tracking 30 or more tasks, you will have enough data to identify patterns:

Which types of tasks do you estimate well? Which do you consistently underestimate?
Do you estimate better for simple tasks than complex ones?
Has your accuracy improved over time?
What is your average ratio by category?

This data feeds directly into the multiplier method and reference class forecasting, creating a virtuous cycle of improving accuracy.

When you plan your day using SettlTM's daily capacity planning, estimated durations determine how many tasks fit into your available hours. Accurate estimates mean realistic daily plans that you can actually complete, rather than aspirational lists that breed frustration. Start planning with accurate capacity to see how estimated durations inform your daily schedule. The Pomodoro timer can also help you track actual working time against your estimates.

Common Estimation Mistakes

Confusing Effort with Duration

Effort is how many hours of work a task requires. Duration is the calendar time from start to finish. A task might require 4 hours of effort but take 3 days of duration because you have meetings, other priorities, and waiting time in between.

Always clarify which one you are estimating. For daily planning, effort is what matters. For project scheduling, duration is what matters.

Ignoring Dependencies and Wait Time

Tasks that depend on other people always take longer than tasks you can do independently. Waiting for feedback, waiting for access, waiting for decisions. These waits do not appear in effort estimates but dramatically impact duration.

Estimating Under Pressure

Never estimate on the spot in a meeting. The social pressure to give a low number is too strong. Ask for time to think about it, check your historical data, and provide a considered estimate later.

Round Number Bias

Humans gravitate toward round numbers: 1 hour, 2 hours, half a day, a week. Real tasks do not come in round-number sizes. If your estimate comes out to 1 hour and 40 minutes, leave it at that. Rounding down to 1 hour creates systematic error.

Failing to Account for Quality Standards

An estimate for "write a blog post" depends entirely on the expected quality level. A quick draft takes 2 hours. A polished, well-researched article takes 8. Make sure your estimate reflects the actual quality standard, not just the core deliverable.

Estimation in Team Contexts

Planning Poker

In agile teams, planning poker uses group estimation to counteract individual biases. Team members independently estimate a task, then reveal their estimates simultaneously. Large discrepancies trigger discussion that often surfaces important information about complexity or risk.

Velocity-Based Estimation

Instead of estimating in hours, agile teams often estimate in relative points and use historical velocity (points completed per sprint) to predict timelines. This approach sidesteps the planning fallacy by relying on demonstrated throughput rather than optimistic forecasts.

Estimation Agreements

Teams should agree on what their estimates include. Does an estimate for a feature include testing? Documentation? Code review? Design? Without agreement, two people estimating the same task might be estimating different scopes of work.

Building an Estimation Practice

Improving your estimation is a long-term practice, not a one-time technique. Here is a practical routine:

Every task: Record your estimate when you create the task
After completion: Record the actual time and the ratio
Weekly review: Scan your ratios for the week. Notice patterns.
Monthly analysis: Calculate your average multiplier by task category. Update your estimation approach.
Quarterly calibration: Review whether your accuracy is improving. If not, try a different technique.

Building an Estimation Culture on Teams

Estimation as a Team Practice

When teams estimate together, individual biases partially cancel out. One person's optimism may be balanced by another's conservatism. The conversation itself surfaces important information about task complexity that solo estimation misses.

Calibration Sessions

Periodically, teams should review their estimation accuracy as a group. Pull up completed tasks with their estimates and actuals. Calculate the team's collective multiplier. Discuss which types of tasks are consistently underestimated and why.

These calibration sessions serve double duty: they improve future estimates and they build shared understanding of task complexity. When everyone sees that database migrations consistently take 2.5 times the estimate, the whole team adjusts their expectations going forward.

Estimation Norms

Teams benefit from explicit norms about estimation:

Estimates include testing and review time unless explicitly stated otherwise
Estimates assume normal conditions, not best case or worst case
Uncertainty is communicated as a range, not a single number
Estimates are updated when new information emerges, without stigma
Actual times are tracked without blame or judgment

These norms create psychological safety around estimation, which improves accuracy because people are not pressured to provide unrealistically low numbers.

The Planning Horizon Effect

Estimation accuracy degrades as the time horizon increases. You can estimate today's tasks within 20 percent. Next week's tasks within 40 percent. Next month's tasks within 100 percent. Six months out, estimates are essentially guesses.

This has practical implications for how you plan: create detailed estimates only for the near term. Use progressively rougher estimates as you look further out. Do not waste time creating precise estimates for work that is months away, because the work itself will likely change before you get to it. This principle supports the sprint-based approach where you estimate in detail only for the current sprint and use rough sizing for everything in the backlog.

Evidence-Based Scheduling

Some project management tools and methodologies track estimation accuracy automatically and use historical data to adjust future projections. The tool learns your personal multiplier over time and applies it transparently. This removes the human effort from calibration while continuously improving accuracy. Even without specialized tools, a simple spreadsheet tracking estimates versus actuals by task category provides the same calibration data with minimal overhead.

Key Takeaways

The planning fallacy causes systematic underestimation. Awareness alone does not fix it. You need structured techniques.
Reference class forecasting uses historical data from similar tasks to ground your estimates in reality rather than optimism.
The multiplier method is the simplest improvement: estimate intuitively, then multiply by your historical accuracy ratio.
Task decomposition improves accuracy by forcing you to think through actual steps and surface hidden work.
Tracking estimates vs. actuals is the highest-leverage practice. You cannot improve what you do not measure.

Frequently Asked Questions

How do I estimate tasks I have never done before?

Use three-point estimation and lean heavily on the pessimistic case. For truly novel tasks, your first estimate should be treated as a rough guess. Update it as you learn more about the actual work involved.

Should I add buffer time to every estimate?

Rather than adding arbitrary buffers, use the multiplier method. This provides a principled adjustment based on your actual historical accuracy rather than a random padding amount.

How long does it take to get good at estimating?

Most people see meaningful improvement after tracking 30 to 50 tasks. Significant accuracy gains typically appear within three to six months of consistent practice.

What is a good estimation accuracy target?

Aim to have your actuals within 20 percent of your estimates for at least 70 percent of tasks. Perfect accuracy is neither achievable nor necessary. The goal is to be consistently close enough for reliable planning.

How do I handle tasks that get interrupted or paused?

Track actual working time separately from calendar time. If a 4-hour task gets spread across three days, the effort estimate was 4 hours regardless of the duration. Use this distinction when calibrating your multiplier.

How to Estimate Task Duration Accurately

Why We Are Terrible at Estimating

The Psychology of Estimation Bias

Optimism Bias

Anchoring to Best Case

Neglecting Overhead

The Uniqueness Bias

Social Pressure

Technique 1: Reference Class Forecasting

What It Is

How to Apply It

Example

Technique 2: The Multiplier Method

What It Is

Finding Your Multiplier

How to Apply It

Technique 3: Three-Point Estimation

What It Is

The PERT Formula

Why Three Points Work

Technique 4: Task Decomposition

What It Is

Why It Works

How to Apply It

Tracking Estimates vs. Actuals

Why Tracking Matters

What to Track

Using Your Data

Common Estimation Mistakes

Confusing Effort with Duration

Ignoring Dependencies and Wait Time

Estimating Under Pressure

Round Number Bias

Failing to Account for Quality Standards

Estimation in Team Contexts

Planning Poker

Velocity-Based Estimation

Estimation Agreements

Building an Estimation Practice

Building an Estimation Culture on Teams

Estimation as a Team Practice

Calibration Sessions

Estimation Norms

The Planning Horizon Effect

Evidence-Based Scheduling

Key Takeaways

Frequently Asked Questions

How do I estimate tasks I have never done before?

Should I add buffer time to every estimate?

How long does it take to get good at estimating?

What is a good estimation accuracy target?

How do I handle tasks that get interrupted or paused?

Ready to plan your day with AI?