- Home
- Blog
- Manufacturing KPIs
- Machine Downtime Tracking: Formulas, Benchmarks, a…
Machine Downtime Tracking: Formulas, Benchmarks, and Reduction Strategies

Machine downtime is one of the most expensive problems in manufacturing — and one of the most preventable. Every hour that a production machine sits idle due to an unplanned breakdown costs far more than the repair bill. It costs throughput, late deliveries, overtime to recover, and cascading schedule disruptions that affect dozens of other orders.
Yet many manufacturers do not track downtime systematically. They know when a machine breaks down, they fix it, and they move on — without capturing the data needed to identify patterns, predict failures, and prevent recurrences. This reactive approach ensures that the same problems keep happening, costing the same money, month after month.
Effective machine downtime tracking transforms maintenance from reactive firefighting into proactive prevention. Combined with finite capacity scheduling that accounts for maintenance requirements and equipment reliability, it creates a system where unplanned downtime steadily decreases while machine availability steadily improves. This guide covers the formulas, benchmarks, analysis techniques, and reduction strategies that make this transformation possible. For how downtime tracking fits into your overall metrics program, see our manufacturing KPIs guide.
Machine Downtime Formulas
Machine Availability
Machine Availability (%) = (Scheduled Time - Downtime) / Scheduled Time x 100
Alternatively:
Machine Availability (%) = Actual Running Time / Scheduled Production Time x 100
If a CNC machine is scheduled for 160 hours per month and runs 140 hours:
Availability = (140 / 160) x 100 = 87.5%
Downtime Percentage
Downtime (%) = Total Downtime Hours / Scheduled Production Hours x 100
The complement of availability: if availability is 87.5%, downtime is 12.5%.
Mean Time Between Failures (MTBF)
MTBF = Total Operating Time / Number of Failures
If a machine operates for 800 hours and experiences 4 breakdowns:
MTBF = 800 / 4 = 200 hours between failures
MTBF measures reliability — higher MTBF means more reliable equipment. Track MTBF trends to evaluate whether your maintenance program is improving reliability over time.
Mean Time to Repair (MTTR)
MTTR = Total Repair Time / Number of Repairs
If those 4 breakdowns required a total of 14 hours to repair:
MTTR = 14 / 4 = 3.5 hours per repair
MTTR measures maintenance effectiveness — lower MTTR means faster recovery from breakdowns.
Availability from MTBF and MTTR
Availability = MTBF / (MTBF + MTTR)
Using the example above: 200 / (200 + 3.5) = 98.3%
This formula is useful for predicting availability based on reliability and maintainability data.
Cost of Downtime
Hourly Downtime Cost = Lost Throughput per Hour + Emergency Repair Cost per Hour + Overtime Recovery Cost per Hour
For a constraint resource:
- Lost throughput: $1,200/hour (revenue minus materials for constraint output)
- Average emergency repair: $300/hour (parts, labor, outside services)
- Overtime recovery: $200/hour (premium labor to catch up)
Total constraint downtime cost: $1,700 per hour
For non-constraint resources, lost throughput may be zero if there is excess capacity to absorb the disruption, but repair costs and potential constraint starvation costs still apply.
Machine Downtime Benchmarks
Availability Benchmarks by Equipment Type
| Equipment Type | Typical Availability | World-Class Target |
|---|---|---|
| CNC Machining Centers | 80-90% | 93%+ |
| Injection Molding | 85-92% | 95%+ |
| Welding Equipment | 82-90% | 93%+ |
| Assembly Lines | 88-95% | 97%+ |
| Stamping/Press | 80-88% | 92%+ |
| Packaging Lines | 75-88% | 93%+ |
| Heat Treatment | 85-93% | 96%+ |
| EDM/Wire EDM | 78-88% | 92%+ |
MTBF and MTTR Benchmarks
| Performance Level | MTBF | MTTR |
|---|---|---|
| World-Class | 400+ hours | Below 1 hour |
| Good | 200-400 hours | 1-2 hours |
| Average | 100-200 hours | 2-4 hours |
| Poor | Below 100 hours | Above 4 hours |
Note that MTBF varies enormously by equipment type and age. A new CNC machine should have MTBF above 1,000 hours, while a 20-year-old manual machine might operate reliably at 200 hours MTBF. The key metric is the MTBF trend — declining MTBF signals increasing reliability problems that need attention.
Planned vs. Unplanned Downtime Ratio
World-class maintenance organizations achieve an 80/20 ratio: 80% of total downtime is planned (preventive and predictive maintenance) and only 20% is unplanned. Most manufacturers have the opposite ratio — 70-80% unplanned downtime — indicating reactive maintenance cultures.
Implementing a Downtime Tracking System
Step 1: Define Downtime Categories
Create a standardized set of downtime reason codes. A typical hierarchy:
Planned Downtime:
- Preventive maintenance
- Planned changeover/setup
- Scheduled calibration
- Planned upgrade/modification
Unplanned Downtime:
- Mechanical breakdown
- Electrical/control failure
- Tooling failure
- Material shortage (machine idle)
- Operator unavailable
- Quality stoppage
- Utility failure (power, air, coolant)
Keep the list manageable — 10-15 codes maximum. Too many codes lead to inconsistent categorization and unreliable data.
Step 2: Establish Data Collection
Downtime data collection options range from manual to fully automated:
Manual logging: Operators record downtime events, duration, and reason codes on paper or simple digital forms. Low cost but prone to inaccuracies and delayed recording.
Semi-automated: Machine monitoring sensors detect when equipment stops, and operators select reason codes from a touchscreen. More accurate timing with human-provided context.
Fully automated: IoT sensors, machine PLC integration, and pattern recognition algorithms detect, categorize, and record downtime events automatically. Most accurate but highest investment.
Start with the level your organization can sustain consistently. Incomplete automated data is worth less than complete manual data.
Step 3: Analyze Downtime Data
Once you have 60-90 days of data, analyze it using these techniques:
Pareto analysis by reason code: Which causes account for the most downtime hours? Focus improvement efforts on the top 3-5 causes.
Pareto analysis by machine: Which machines have the worst availability? Prioritize maintenance investment on equipment with the highest downtime and the highest throughput impact.
Time pattern analysis: Does downtime spike on certain shifts, days, or times? Patterns often reveal root causes — downtime after weekends might indicate temperature-related issues, for example.
MTBF trend analysis: Plot MTBF by month for critical machines. Declining MTBF signals impending reliability failure and should trigger proactive maintenance intervention.
Strategies to Reduce Machine Downtime
Strategy 1: Transition from Reactive to Preventive Maintenance
The highest-impact shift most manufacturers can make is implementing a structured preventive maintenance (PM) program. Key elements:
- PM schedules based on manufacturer recommendations and operating data — not gut feeling
- PM task standardization with detailed procedures, required parts, and time estimates
- PM compliance tracking to ensure maintenance is actually performed on schedule
- Spare parts management to ensure critical parts are available when needed
A well-implemented PM program typically reduces unplanned downtime by 30-50% within the first year.
Strategy 2: Integrate Maintenance into Production Scheduling
One of the biggest barriers to PM compliance is production pressure — "we cannot take the machine down for maintenance because we have orders to run." This conflict disappears when maintenance windows are scheduled alongside production orders.
RMDB scheduling software integrates maintenance activities directly into the production schedule. The scheduler accounts for PM windows when promising delivery dates and sequencing production, ensuring that maintenance happens without disrupting deliveries. This eliminates the false choice between maintenance and production.
Strategy 3: Implement Condition-Based Monitoring
Predictive maintenance uses real-time equipment monitoring to detect degradation before failure occurs:
- Vibration analysis detects bearing wear, misalignment, and imbalance
- Oil analysis detects contamination, wear particles, and fluid degradation
- Thermal monitoring detects hot spots indicating electrical or mechanical problems
- Power consumption monitoring detects efficiency losses indicating wear
Condition-based maintenance optimizes the PM schedule — maintenance is performed when the equipment indicates it needs attention, not on a fixed calendar that may be too early (wasting maintenance resources) or too late (after the failure has begun).
Strategy 4: Reduce Mean Time to Repair
When breakdowns do occur, faster repair minimizes the production impact:
- Pre-staged spare parts for common failure modes eliminate procurement delays
- Standardized diagnostic procedures help maintenance technicians identify problems faster
- Cross-training ensures multiple technicians can work on critical equipment
- Maintenance information systems provide repair history, parts lists, and procedures at point of use
- Vendor support agreements provide rapid response for complex repairs
Strategy 5: Address Root Causes, Not Symptoms
Many maintenance organizations repeatedly repair the same failures without asking why the failure keeps occurring. Implement root cause analysis for every significant downtime event:
- What failed?
- Why did it fail?
- What conditions led to the failure?
- How can we prevent recurrence?
- What early warning signs should we monitor?
Document findings and corrective actions. Track recurrence rates to verify that root cause corrections are effective.
Strategy 6: Reduce Changeover-Related Downtime
Changeover time — the time between the last good piece of one job and the first good piece of the next — is technically planned downtime but often exceeds planned duration. Changeover time reduction through SMED methodology can cut changeover times by 40-60%, and scheduling software minimizes the number of changeovers needed through intelligent job sequencing.
Downtime's Impact on Other Manufacturing KPIs
Machine downtime creates cascading effects across your KPI dashboard:
On-time delivery: Unplanned downtime at constraint resources directly delays orders. Even non-constraint downtime can cause delays if it starves the bottleneck.
Throughput rate: Every hour of constraint downtime is an hour of lost factory throughput. For a constraint with $1,200/hour throughput, 50 hours of annual unplanned downtime costs $60,000 in lost throughput.
Manufacturing cycle time: Downtime increases queue times as work backs up behind the failed machine. One hour of downtime can add days of cycle time to jobs in the queue.
Schedule adherence: Unplanned downtime is the most common cause of schedule adherence failures. No schedule can survive frequent unpredictable machine failures.
WIP inventory: Downtime causes WIP accumulation upstream of the failed machine and starvation downstream, creating imbalanced flow.
Cost per unit: Downtime reduces output while fixed costs remain constant, increasing the overhead allocated to each unit produced.
Downtime Tracking Metrics Dashboard
Monitor these metrics to drive continuous downtime reduction:
| Metric | Frequency | Target |
|---|---|---|
| Overall machine availability | Daily | Above 90% |
| Unplanned downtime hours | Weekly | Trending down |
| Planned/unplanned ratio | Monthly | Moving toward 80/20 |
| MTBF by critical machine | Monthly | Trending up |
| MTTR by critical machine | Monthly | Trending down |
| PM compliance rate | Weekly | Above 95% |
| Top 5 downtime causes | Monthly | Declining |
| Constraint availability | Daily | Above 95% |
| Downtime cost ($) | Monthly | Trending down |
Display constraint machine availability prominently — it has the largest impact on factory performance and should receive the most management attention.
Take Control of Your Equipment Reliability
Machine downtime is not a random, unavoidable event — it is a manageable, reducible cost. The combination of systematic tracking, preventive maintenance, scheduling integration, and root cause analysis consistently reduces unplanned downtime by 30-50% within the first year.
User Solutions helps manufacturers integrate equipment reliability into their scheduling and planning processes through RMDB scheduling software that accounts for maintenance requirements, and EDGEBI analytics that provide the downtime visibility needed to drive continuous improvement. Our approach ensures that maintenance supports production rather than competing with it.
Request a demo to see how RMDB scheduling integrates maintenance planning with production scheduling for maximum equipment availability and on-time delivery.
Expert Q&A: Deep Dive
Q: How should manufacturers prioritize which machines to track for downtime?
A: Start with your constraint resources — the bottleneck machines where downtime directly reduces factory throughput. Every hour of downtime at the constraint is an hour of lost throughput for the entire plant. Next, track machines with the highest downtime history or the most expensive repair costs. You do not need to implement comprehensive tracking on every machine simultaneously. We recommend starting with 5-10 critical machines, proving the value of tracking, and expanding from there. RMDB scheduling identifies your constraint resources automatically, making it clear where downtime tracking delivers the highest ROI.
Q: What is the true cost of one hour of unplanned downtime?
A: For a constraint resource, the cost is: Throughput Dollars per Hour + Emergency Repair Cost + Overtime to Recover + Schedule Disruption Cost. For a typical mid-size manufacturer, one hour of constraint downtime costs $500-$2,000 in throughput alone, plus $200-$500 in repair costs and potentially thousands more in cascading schedule disruptions. Non-constraint downtime costs less directly but still incurs repair costs and can cause constraint starvation if the downed machine feeds the bottleneck.
Q: How do you justify preventive maintenance when it takes machines offline during production time?
A: Frame it as an investment, not a cost. One hour of planned preventive maintenance prevents an average of 3-8 hours of unplanned breakdown time, based on industry reliability data. Planned maintenance can be scheduled during low-demand periods and coordinated with changeovers. Unplanned breakdowns happen at the worst possible time — when the machine is running production. RMDB scheduling integrates maintenance windows into the production schedule, ensuring preventive maintenance happens without disrupting delivery commitments.
Frequently Asked Questions
Ready to Transform Your Production Scheduling?
User Solutions has been helping manufacturers optimize their production schedules for over 35 years. One-time license, 5-day implementation.

User Solutions Team
Manufacturing Software Experts
User Solutions has been developing production planning and scheduling software for manufacturers since 1991. Our team combines 35+ years of manufacturing software expertise with deep industry knowledge to help factories optimize their operations.
Share this article
Related Articles

Capacity Utilization KPI: Formula, Benchmarks, and Optimization Strategies
Calculate manufacturing capacity utilization accurately, compare benchmarks by industry, and learn why optimizing utilization — not maximizing it — drives the best manufacturing outcomes.

Changeover Time Reduction: SMED Guide with Formulas and Strategies
Reduce manufacturing changeover times by 40-60% using SMED methodology. Includes changeover time formulas, industry benchmarks, step-by-step implementation, and scheduling optimization techniques.

Cost Per Unit in Manufacturing: Calculation, Benchmarks, and Reduction Strategies
Calculate manufacturing cost per unit accurately with formulas for material, labor, and overhead allocation. Includes benchmarks and strategies to reduce unit costs through scheduling optimization.
