What Is an SLA (Service Level Agreement) in IT?
An SLA — Service Level Agreement — is a contractual commitment between an IT provider and a client that defines the minimum service standards: response times for different issue priorities, uptime guarantees, escalation procedures, and remedies when SLAs are not met. In managed IT services, the SLA is the foundation of the client-provider relationship — it defines exactly how quickly the MSP will respond when you have a problem, and what happens if they don't.
IT SLA tiers: how response times are structured
Well-structured IT SLAs define at least three priority levels:
- Critical (P1) — complete system outage, ransomware attack, server down, all users affected; typical response: 15-30 minutes; typical resolution target: 2-4 hours
- High (P2) — significant function unavailable, major application down, multiple users affected; typical response: 1-2 hours; typical resolution target: 4-8 hours
- Standard (P3) — single user affected, workstation issue, non-urgent software problem; typical response: 4 hours; typical resolution target: next business day
- Low (P4) — information requests, minor inconveniences, non-urgent questions; typical response: next business day; typical resolution target: within 3 business days
What good SLAs include (and what to watch out for)
What a good IT SLA includes:
- Clear priority definitions with objective criteria (not just "urgent" vs. "normal")
- 24/7 coverage for P1 and P2 issues — not just business hours
- Response time vs. resolution time clearly distinguished — a response in 15 minutes doesn't mean your issue is fixed in 15 minutes
- Uptime SLAs for hosted services (e.g., 99.9% uptime = 8.76 hours downtime/year maximum)
- Remedies for SLA breaches — service credits or other remedies if response times are missed
- Exclusions clearly stated — what falls outside the SLA (user-caused issues, third-party outages, etc.)
SLA red flags to avoid:
- Response times measured from when a ticket is assigned (not from when you submit it)
- Business-hours-only P1 response — ransomware doesn't wait until Monday morning
- No defined remedies for SLA breaches
- Vague language: "best efforts," "reasonable time," without defined metrics
Outsource IT Canada SLA tiers
- P1 — Critical: 15-minute response, 24/7/365 — system down, ransomware, business-halting outage
- P2 — High: 1-hour response, business hours — major function unavailable, multiple users affected
- P3 — Standard: 4-hour response, business hours — single user issue, non-urgent software problem
- P4 — Low: next business day — information requests, minor issues
SLA vs. OLA (Operating Level Agreement)
- SLA: agreement between IT provider and end client; defines client-facing service commitments
- OLA: agreement between internal teams within an MSP; defines how internal departments support each other to meet SLA commitments
- UC (Underpinning Contract): agreement between MSP and third-party vendors (ISPs, hardware suppliers) that support SLA delivery
Related glossary terms
- MSP — Managed Service Provider
- RTO — Recovery Time Objective
- RPO — Recovery Point Objective
- BDR — Backup and Disaster Recovery
- RMM — Remote Monitoring and Management
How Outsource IT Canada can help
- Managed IT Services — 24/7 monitoring and flat-rate IT support for Canadian businesses
- Cybersecurity Services — EDR, MDR, dark web monitoring, and incident response
- PIPEDA Compliance — privacy impact assessments and breach notification procedures
- Get a free assessment — call (416) 623-9677
Ready to transform your IT? Call (416) 623-9677 for a free assessment.