Marketing often creates them, sales makes the promise, and operations has to deliver them.
Service Level Agreements or SLAs are at the heart of Ethernet business services. They continue to gain traction in the market as the competitive differentiator for many service providers looking to out “market” and perform their competition.
- How do you manage the network and service to an SLA?
- How do you interpret SLA jargon into operational process and procedure?
- How do you monitor and measure the network to guarantee five nines service availability, reduce MTTR, response time, latency, jitter and packet loss?
- How do you guarantee installation intervals?
- More importantly how do you preempt SLA violations to avoid penalties and service credits?
Download the whitepaper and see how real-world Service Level Agreements can be translated into actionable business processes and set the operational requirements needed to successfully manage SLA backed business services.
Need more information? Start a conversation here: TestCenterLIVE@spirent.com
Service Provider TacTicS To enSure
SLa-backed eTherneT ServiceS
April 2012
Rev. A 04/12
SPIRENT
1325 Borregas Avenue
Sunnyvale, CA 94089 USA
Email: sales@spirent.com
Web: www.spirent.com
AmERIcAS 1-800-SPIRENT • +1-818-676-2683 • sales@spirent.com
EuRoPE ANd ThE mIddlE EAST +44 (0) 1293 767979 • emeainfo@spirent.com
ASIA ANd ThE PAcIfIc +86-10-8518-2539 • salesasia@spirent.com
© 2012 Spirent. All Rights Reserved.
All of the company names and/or brand names and/or product names referred to in this document, in particular,
the name “Spirent” and its logo device, are either registered trademarks or trademarks of Spirent plc and its
subsidiaries, pending registration in accordance with relevant national laws. All other registered trademarks or
trademarks are the property of their respective owners.
The information contained in this document is subject to change without notice and does not represent a
commitment on the part of Spirent. The information in this document is believed to be accurate and reliable;
however, Spirent assumes no responsibility or liability for any errors or inaccuracies that may appear in the
document.
Service Provider Tactics to Ensure SLA-backed
Ethernet Services
Spirent white paper • i
Why MSOs are Aggressively Offering Ethernet Services . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .1
The Applications . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .2
Mobile Backhaul . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .2
Business Services . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .3
The Challenge . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .4
Ethernet SLAs . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .5
Vertical markets . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .5
A Comparison of Typical SLAs . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .6
Breaking Down the Ethernet SLA: Defining Operation Milestones . . . . . . . . . . . . . . . . . . . . . .7
Turn Up . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .7
Performance Management and Key Performance Indicators . . . . . . . . . . . . . . . . . . . . . . . . . . .8
Trouble Maintenance . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .8
Penalties and Credits . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .9
Moving into the Big Leagues with an SLA . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .9
Traffic Diversity and Class of Service Requirements . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .10
Translating SLAs into Operational Requirements . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .11
Turn Up . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .11
Services . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .11
The Current Turn Up Process . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .12
Service Validation . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .13
Optimizing Turn Up . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .13
Performance Management . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .17
Tracking KPIs . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .17
Active Monitoring . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .18
PM as a Management Tool . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .19
PM as Triage . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .20
Maintenance . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .20
Conclusion . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .23
Service Provider Tactics to Ensure SLA-backed Ethernet Services
1 • SPIREnT WhITE PAPER
Why SERVICE PROVIDERS ARE AggRESSIVELy OffERIng EThERnET SERVICES
The proliferation of Ethernet has opened up several new markets to service
providers .
The trends have been clear for some time—the growth of IP and mobile real-
time traffic, low-cost Ethernet ports, the revenue shift from voice to data, and
the packet video explosion . Everything points to Ethernet as the transport
technology of today and into the future .
According to frost & Sullivan
1
, the retail US Carrier Ethernet services market
continued to keep up its 2009 momentum through 2010, growing at an
impressive annual growth rate of 40 percent . Worldwide Business Ethernet
Services market will grow to $US 40 .2 billion by 2014, according to
Vertical Systems
2
.
A yankee group
3
report indicates that the market for wholesale backhaul
services in north America will grow from $US 2 .45 billion in 2010 to $US 3 .9
billion in 2015, with the majority of this growth coming from Ethernet backhaul
4
.
new players to the business services market are not watching from the
sidelines . In 2007, Cox Business became the first MSO to reach the top tier of
U .S . business Ethernet providers . Time Warner Cable’s “Business Class” was
recognized as the Metro Ethernet forum’s (MEf) 2008 Ethernet Service Provider
of the year . In 2009 Comcast moved beyond providing voice and data services
over a traditional DOCSIS-based coax network for small and medium sized
business (SMB) to serving mobile backhaul on Ethernet, and more recently
offering Metro Ethernet services to medium-sized businesses . .
1 frost & Sullivan: U.S. Retail Carrier Ethernet Services Market Update, 2011, August 2011 .
2 fierce Telecom: eBook Ethernet Exchanges Make the Interconnection, December 2009
http://www .verticalsystems .com/download/Ethernet%20ebook .pdf
3 yankee group: Wholesale Mobile Backhaul: There’s Gold in Them There Hauls, yankeegroup .com, June 2011
4 yankee group: Wholesale Mobile Backhaul: There’s Gold in Them There Hauls, yankeegroup .com, June 2011
Service Provider Tactics to Ensure SLA-backed Ethernet Services
SPIREnT WhITE PAPER • 2
Figure 1. Infonetics Research, Mobile Backhaul Equipment and Services, October 2010
ThE APPLICATIOnS
There are several markets where Ethernet plays a critical role and where service
providers can provide solutions and generate revenue, which include mobile
backhaul and business services .
Mobile Backhaul
Legacy mobile backhaul is one of the biggest contributors to the high cost of
delivering real-time content . Time division multiplexing (TDM) circuits with their
fixed bit rates do not scale easily in response to variations in demand and are
six times as expensive to install and maintain as the alternative, Ethernet .
however, legacy TDM backhaul service has several features not found
natively in Ethernet, such as path-level monitoring and fault detection, simple
provisioning, fast protection switching, and robust timing and synchronization .
for mobile operators to be able to exploit the affordability of Ethernet in the
mobile backhaul network, it must support these capabilities . Therefore, service
providers that sell Ethernet mobile backhaul must be able to deliver on these
expectations to be competitive .
Service Provider Tactics to Ensure SLA-backed Ethernet Services
3 • SPIREnT WhITE PAPER
According to an Ovum
5
report, all backhaul connections will be carried over
Ethernet by 2015 . Today, only 10 percent of backhaul connects are Ethernet .
Over 100 network operators are actively deploying a single IP-Ethernet backhaul
to carry all types of traffic, including voice and data, according to Infonetics
Research
6
. This exciting market has attracted many service providers as the
competition increases .
Business Services
Delivering business services using Carrier Ethernet solutions is generally broken
into three basic architectures designed to enterprise’s network requirements:
• Point-to-Point Service (E-LInE)
• Multipoint Service (E-LAn)
• Point to Multi-point broadcast (E-TREE)
Figure 2. Metro Ethernet Services, (Source Metro Ethernet Forum)
5 Ovum: Mobile Backhaul Forecast (Global), 2010
6 Infonetics Research: Mobile Backhaul Equipment and Services: Biannual Worldwide and Regional Market Share, Size, and Forecasts,
2nd Edition, September 29, 2011
Service Provider Tactics to Ensure SLA-backed Ethernet Services
SPIREnT WhITE PAPER • 4
Corporate customers have specific requirements for service activation times,
availability, and performance . Providers deliver this through various methods,
including granular bandwidth allocation and tiers of service with performance
guarantees, such as:
• Premium service for real-time IP telephony or IP video applications
• Silver service for bursty mission-critical data applications requiring low
loss and delay, such as storage networks
• Bronze service for bursty data applications requiring bandwidth
assurances, such as video
• Standard service for best-effort applications such as email and
web browsing
Traditionally, incumbent Telco service providers have owned this market in the
past . So new providers offering Metro Ethernet services must be able to deliver
similar competitive service levels
ThE ChALLEngE
facing a competitive landscape is not an easy task . In a competitive sales
situation, customers often go with an incumbent carrier because nobody has
ever been fired for going with what seems like a sure thing, whereas trusting a
provider new to the market to deliver carrier-grade service over Ethernet can be
seen as a risky move .
As a result, simply marketing an Ethernet-based service as an alternative to a
traditional carrier is not enough . The competitive differentiator is the ability to
deliver services that comply with a well-defined service level agreement (SLA) .
Delivering on an SLA with penalties and credits for violations is normal business
for incumbent carriers, and new territory for others . however, it is a minimum
requirement for entering the medium and large business services market
Figure 3. Structure of a business service SLA
SERVICE
ACTIVATION
• Critical turn up dates with penalties
• One or more offices
• Quality of Service validated
PERFORMANCE
MONITORING
• 24x7 “always on” quality
• Availability, Loss, Latency
• Performance guarantees
TROUBLE
MANAGEMENT
• 24x7 call center
• Mean Time to Repair
• Multiple end points
Service Provider Tactics to Ensure SLA-backed Ethernet Services
5 • SPIREnT WhITE PAPER
With mobile carriers turning up dozens of cell sites per week, quick turnaround
on turn up is essential . The ability of a service provider to deliver connections
validated for the required quality of service (QoS) performance requirements at
multiple locations quickly and efficiently is the first step, followed by managing
a production network to those performance requirements to further differentiate
their offering .
To efficiently compete, new Ethernet providers must make fundamental changes .
The first step is to develop competitive service backed by SLAs . But, to deliver
these services, the SLAs must be translated into operational requirements .
EThERnET SLAs
When moving from TDM to Ethernet SLAs, the key performance indicators (KPIs)
change from bit-level metrics to packet-level metrics . In addition, the metrics of
interest vary by market
Vertical markets
According to the Metro Ethernet forum, popularity of Layer 2 Carrier Ethernet
services tends to vary across industry market sectors, with finance, healthcare,
and retail among the biggest users of Ethernet . for finance, the focus is on
ensuring high availability and near-zero latency so that information can move
between branches, exchanges and data centers with millisecond accuracy .
Within healthcare, the focus is on medical imaging and electronic health
record applications, which drive significant bandwidth requirements and high
availability . Retailers are focused on driving real-time, actionable information,
which drives the need to support higher capacity, connection resiliency, and
low latency .
Popularity of Carrier Ethernet in Industry Market Sectors
(Metro Ethernet Forum)
40
30
20
10
0
P
er
c
ent
of
C
E
Ser
v
ic
e
s
Finance
Real Estate
Medical
Education
Data Center
Legal
Retail
Media
Figure 4. Ethernet Services Markets (Source Metro Ethernet Forum)
Service Provider Tactics to Ensure SLA-backed Ethernet Services
SPIREnT WhITE PAPER • 6
A Comparison of Typical SLAs
In the current market, brand is no longer seen as a guarantee of performance
and value . All aspects of a service—from activation intervals to latency and
loss guarantees—are evaluated and compared when choosing a vendor .
Deadlines are tight, time to revenue is critical, and performance is a competitive
differentiator .
Table 1 shows SLAs currently offered by four anonymous carriers . Many of the
details are fairly standard across all carriers, such as 24 x 7 support and credits
for certain violations set at 1/30 of monthly recurring cost (MRC) or two percent
of MRC per hour . however, other parameters have a wide range of values . for
example, the greatest latency guarantee (25 ms) is two-and-a-half times as long
as the least (10 ms) . Service activation intervals range from seven days to over
six times as long, at 45 days .
Table 1. Four SLAs compared
CriTeriA SLA 1 SLA 2 SLA 3 SLA 4
Service Activation
Interval
Negotiated 45 days on-net,
negotiated off-net
7 days 14 work days
Activation Penalty 1/30 of MRC
per day
5% of MRC
per day
Negotiated 5% of MRC
per day
24 x 7 call center YES YES YES YES
Mean Time to Repair
(MTTR)
4 hours 4 hours 4 hours 2 hours on-net,
4 hours off-net
MTTR Penalty 1/30 of MRC
per hour
Every hour over is
5% of MRC
1/30 of MRC
per hour
2% of MRC
per hour over
Bandwidth (CIR)
Credit
Negotiate 1/30 of MRC
per day <CIR
1/3 MRC per day Negotiated
Bandwidth Ordered CIR Ordered CIR Ordered CIR Ordered CIR
BER N/A N/A N/A 1 x 10** —9
Loss 0.5% Avg. Negotiated N/A 0.2% Avg.
Loss Penalty 1/30 of MRC
per 1%
Negotiated N/A 1/30 of MRC
per %
Latency 25 ms Negotiated 15 ms 10 ms
Latency Penalty 1/30 of MRC
per 1 ms
Negotiated N/A 1/30 of MRC
per 1 ms
Availability 99.95% Negotiated 99.99% 95%
Availability Penalty 1/30 MRC
per hour
<4 hour = 5%,
up to 50%
1 hr = 2%
5% after first hour
2% of MRC
per hour
Service Provider Tactics to Ensure SLA-backed Ethernet Services
7 • SPIREnT WhITE PAPER
Table 2. Turn-up focused SLA requirements
CriTeriA SLA 1 SLA 2 SLA 3 SLA 4
Service Activation
Interval
Negotiated 45 days on-net,
negotiated off-net
7 days 14 work days
Activation Penalty 1/30 of MRC
per day
5% of MRC
per day
Negotiated 5% of MRC per day
Bandwidth Ordered CIR Ordered CIR Ordered CIR Ordered CIR
Loss 0.5% Avg. Negotiated N/A 0.2% Avg.
Latency 25 ms Negotiated 15 ms 10 ms
An SLA for a legacy TDM service specifies KPIs such as Errored Seconds,
Severely Errored Seconds, and Unavailable Seconds based on CRC detected
(bit) errors . however, Ethernet does not have the built-in timing of a
synchronous connection .
Although some current SLAs do include a TDM set of performance
measurements, such as SLA 4 in Table 1 which establishes a bit error rate
guarantee for Layer 1 transport, most SLAs based on carrier Ethernet focus
on availability along with packet loss and latency measurements .
Breaking Down the ethernet SLA: Defining Operation Milestones
An SLA is a compilation of requirements across three main categories: turn up,
performance management, and maintenance .
Turn Up
When will the service be ready to use? This seems like a straightforward
question, but it can pose significant logistical challenges . for a single business
service, such as a point-to-point Ethernet E-Line service, turn up is the promised
date of operation .
Typically, timing is critical in that traffic must be handed over between two
services when one service, such as a leased line, is turned off and another, such
as an E-Line, is turned up . The specific application may allow an off-hours cut
over during a planned maintenance window, or it may require a zero-down-time
live cut over on a production network .
Other applications, such as mobile backhaul services, may require a seamless
cut over of dozens of sites at a specific time on specific dates . A single Ethernet
services contract could have hundreds of sites . hitting critical dates may be
associated with liquidated damages for site failures . Validating multiple sites is
essential, as the SLA metrics will be both time and performance sensitive, and
can specify either a date or a time interval for turn up .
Service Provider Tactics to Ensure SLA-backed Ethernet Services
8 • SPIREnT WhITE PAPER
Table 3. Performance Management focused SLA requirements
CriTeriA SLA 1 SLA 2 SLA 3 SLA 4
Loss 0.5% Avg. Negotiated N/A 0.2% Avg.
Loss Penalty 1/30 of MRC
per 1%
Negotiated N/A 1/30 of MRC
per %
Latency 25 ms Negotiated 15 ms 10 ms
Latency Penalty 1/30 of MRC
per 1 ms
Negotiated N/A 1/30 of MRC
per 1 ms
Availability 99.95% Negotiated 99.99% 95%
Availability
Penalty
1/30 MRC
per hour
<4 hour = 5%,
up to 50%
1 hr = 2%
5% after first hour
2% of MRC
per hour
Table 4. Trouble Maintenance focused SLA requirements
CriTeriA SLA 1 SLA 2 SLA 3 SLA 4
24 x 7 call center YES YES YES YES
Mean Time to
Repair (MTTR)
4 hours 4 hours 4 hours 2 hours on-net,
4 hours off-net
MTTR Penalty 1/3 of MRC
per hour
Every hour over is
5% of MRC
1/30 of MRC
per hour
2% of MRC
per hour over
Bandwidth (CIR)
Credit
Negotiate 1/30 of MRC
per day <CIR
1/3 MRC per day Negotiated
Trouble Maintenance
Once a service is up and running, there is a 100% chance that at some time
the service or the customer will have a problem . Mean time to repair or resolve
(MTTR) is the grandfather of every SLA KPI . When something goes wrong, how
long does it take to fix it? The more critical the service or application, the shorter
the MTTR requirement from the enterprise .
Performance Management and Key Performance indicators
Performance management (PM) covers a broad range of KPIs . All SLAs have a
provision that mandates 24x7 monitoring combined with a 24x7 support line
to log service outages and troubles . This requirement is in stark contrast to
residential service, and even the level of support required by small business
services .
The ability to offer always-on performance monitoring opens the door to a
premium level of service for financial and international businesses, which rely
on quality network performance including, always-on availability with low loss,
latency, and jitter performance .
Service Provider Tactics to Ensure SLA-backed Ethernet Services
SPIREnT WhITE PAPER • 9
Table 5. Penalties and Credits focused SLA requirements
CriTeriA SLA 1 SLA 2 SLA 3 SLA 4
Activation Penalty 1/30 of MRC
per day
5% of MRC
per day
Negotiated 5% of MRC
per day
Loss Penalty 1/30 of MRC
per 1%
Negotiated N/A 1/30 of MRC
per %
Latency Penalty 1/30 of MRC
per 1 ms
Negotiated N/A 1/30 of MRC
per 1 ms
Availability Penalty 1/30 MRC
per hour
<4 hour = 5%,
up to 50%
1 hr = 2%
5% after first hour
2% of MRC
per hour
Bandwidth Penalty Negotiate 1/30 of MRC
per day <CIR
1/30 of MRC
per day
Negotiated
Moving into the Big Leagues with an SLA
Providers traditionally focused on residential and/or SMBs will have to make
changes to meet the requirements of medium-to-large business services,
which require:
• Time sensitive, rigid, penalty-based, 24 x 7 service management,
monitoring and field dispatch
• Coordination of technician dispatch, historical trending of service issues,
and reporting to the customer for multiple service endpoints
• Increased visibility into the network in both wholesale and retail
service markets
• The ability to segment service to assign responsibility for repairs
• The ability to span multiple systems across MSO and Telco providers
Penalties and Credits
SLAs offer assurances to the customer of a defined level of performance and
compensation if the service level is not met . Penalties and credits are the
teeth of the agreement, poised to bite the provider when the service delivered
falls below the guarantees . An SLA can define blanket penalties for service
disruption or penalties for specific SLA violations . Additionally, penalties are
used as competitive differentiators . Customers look to the credits and penalties
as a measure of the confidence the service provider has in its product . The
higher or more stringent the penalty or credit, the greater the likelihood that the
provider will have very few service disruptions .
Service Provider Tactics to Ensure SLA-backed Ethernet Services
10 • SPIREnT WhITE PAPER
Table 6. example CoS-based Metro ethernet SLA from the MeF
CLASS ChArACTeriSTiCS CoS iD
BAnDwiDTh
PrOFiLe Per eVC
Per CoS iD
SerViCe
PerFOrMAnCe
Premium Real-time IP telephony or
IP video applications
6, 7 CIR > 0
EIR = 0
Delay < 5ms
Jitter <1ms
Loss <0.001%
Silver Bursty mission-critical
data applications
requiring low loss
and delay
4, 5 CIR > 0
EIR <=UNI Speed
Delay <5ms
Jitter <N/S
Loss <0.01%
Bronze Bursty data applications
requiring bandwidth
assurances
3, 4` CIR > 0
EIR <= UNI Speed
Delay <15ms
Jitter <N/S
Loss <0.1%
Standard Best Effort Service 0, 1, 2 CIR > 0
EIR <= UNI Speed
Delay <30ms
Jitter <N/S
Loss <0.5%
Traffic Diversity and Class of Service requirements
Class of service (CoS), the assignment of service performance levels within an
SLA, places an even greater demand on operations . Application and service
specific QoS levels dictate CoS capabilities to match the QoS requirements .
In each of the sample SLAs, there is one set of KPIs—loss and latency . As a
result, these SLAs only provide guarantees on a single class of service . Traffic
diversity introduces the need for multiple classes of service . SLAs must support
graduated classes of service . Providers who can respond with these levels of
guarantees will remain competitive in the always-on, always connected world .
Standard bodies such as the International Telecommunications Union (ITU)
and the Metro Ethernet forum (MEf) are at the forefront of defining KPI values
for multiple classes of service for known applications . Vanilla, one-size-fits-all
performance metrics no longer meet the needs of subscribers .
Service Provider Tactics to Ensure SLA-backed Ethernet Services
SPIREnT WhITE PAPER • 11
TRAnSLATIng SLAs InTO OPERATIOnAL REQUIREMEnTS
for an provider to deliver service to an SLA, the guarantees of the SLA must be
translated into specific operational practices . By organizing SLA requirements
into the three main categories of turn up, PM and maintenance, providers can
develop best practices and achieve efficiency and service quality .
Turn Up
An SLA specifies turn up times, but turn up is more than just activating a link
or service . Turn up testing validates that the service meets the QoS specified in
the SLA from the beginning . no longer is best effort service sufficient . Validation
based on whether the “green light” is on, or whether packets are sent and
received is not acceptable . furthermore, documentation is required (e .g ., a birth
certificate) that the installed service meets the SLA .
for example, SLA 2 in Table 2 has an installation interval declaration (the time
from ordering a circuit to its availability) of 45 days for on-network locations,
leaving the installation interval negotiable for off-network locations .
Services
There are generally two focus areas in the Ethernet service portfolio for a
wireline service provider:
1 . Wholesale service for delivering Ethernet connections as part of an
end-to-end service for another service provider
2 . Retail service for delivering business-class service to individual
enterprises
Examples of wholesale service include mobile backhaul connections for wireless
service providers from cell towers to the core packet network, or one service
provider buying connections from another provider to office locations of an
enterprise where they cannot provide the total service themselves .
Most of the SLA examples in Table 1 focus on retail service . The provider has
complete ownership of the network and generally turns up a small number of
office locations (1–10) . In the case of a wholesale service, one order may include
a large number of connections (hundreds) that must be turned up .
Some SLAs state an exact turn up timetable per location, while others specify a
timetable per service . for example, a private line connection will typically state
an exact date, while a mobile backhaul service might be lumped into groups of
sites prioritized by the customer .
Service Provider Tactics to Ensure SLA-backed Ethernet Services
12 • SPIREnT WhITE PAPER
Figure 5. Today’s typical approach for service turn-up is field technician based
The Current Turn Up Process
Today’s service providers use field technicians at both ends of the service and in
the network offices to turn up and validate the service as shown in figure 5 . Key
to this strategy is:
• Coordinated field dispatch to offices or cell towers to confirm the service
is functional
• Validate QoS to match the SLA
The Achilles’ heel of this approach is that scaling the service is directly
proportional to how many field technicians can be hired, retained, and properly
coordinated . As service providers scale their service, they begin to realize that
this approach does not scale; can’t coordinate, can’t hire, and can’t support the
time issues induced by a service validation failure . Can the install be completed
on time to avoid the penalty?
Service Provider Tactics to Ensure SLA-backed Ethernet Services
SPIREnT WhITE PAPER • 13
Service Validation
While construction, equipment allocation and installation have the lion’s share
of impact on turn up, once the construction is completed a separate dispatch,
usually requiring multiple resources, is required to validate the new service . This
step adds time and expense and requires documentation, for example, the birth
certificate establishing that the service meets the SLA at the time up turn up .
field hand held tools are not capable of producing this documentation .
Streamlining the turn up process to reduce the impact to field service teams has
traditionally been an operational challenge . however, some providers are finding
a way to reduce the cost and time required for turn up through a model that
scales while providing SLA compliance through a remote probe-based solution .
In 2010, the MEf named Verizon the global Service Provider of the year for its
innovation in developing and delivering robust wholesale service offerings that
extend the reach of Ethernet services, including mobile backhaul and business
Ethernet services . They also awarded AT&T the Best Business Ethernet Service
Award for providing the most innovative business Ethernet service
for enterprise .
Optimizing Turn Up
Cost-effective and efficient tactics are a must to maintain a competitive
advantage and reduce the impact of accelerated turn up schedules on field
technicians . Remote testing at the time of installation minimizes post
turn-up dispatches .
In mobile backhaul, contracts are contingent on service turn up intervals . It is
not unheard of for a wireless carrier to turn up hundreds of sites a month and
many rely on a wireline service provider to furnish the tower backhaul . These
aggressive turn up schedules are passed along to the provider that can be a
competitive factor, particularly when it comes to the turn up intervals .
Typically, during turn up two field technicians are dispatched—one to the tower
and the other to the provider edge—and both are equipped with handheld
testers to run an RfC 2544 test on each class of service at each frame size .
This approach as shown in figure 5 requires a coordinated effort with two
technicians who must be available at the same time . Many service providers are
moving toward an alternative approach, which includes:
• Installing remote controlled service assurance test probes in the network
to enable testing remotely from the network operations center (nOC)
without a dispatch to the network
• Installing service assurance network-terminating equipment such as an
ALU 7210, Cisco 2941, or other network interface device, which can be
controlled from the nOC and providing a loopback function
Service Provider Tactics to Ensure SLA-backed Ethernet Services
14 • SPIREnT WhITE PAPER
The combination of this solution shown in figure 6 allows connectivity testing
and validation of the QoS to meet the SLA requirements without coordinated
dispatches to all end points of the service, as shown in figure 5 . This automated
approach saves four to five hours of drive time, set up time, and test time . By
centralizing and automating, a provider can significantly reduce the effort and
time it takes to run the test, capture results, and store them for SLA reporting
requirements .
Figure 6. Optimizing service activation via remote controlled Service Assurance devices
NTE with
Loopback
NTE with
Loopback
NTE with
Loopback
NTE with
Loopback
NTE with
Loopback
NTE with
Loopback
NTE with
Loopback
NTE with
Loopback
Service Assurance Probes
Up to 80 percent of the issues with service turn up are related to network
configuration problems, such as MTU size, CIR/EIR set incorrectly, specifying
the wrong Ethernet profiles to support single or multiple classes of service,
and similar issues . During each turn up it is critical to individually validate the
configuration of each of the classes of service . But, testing each CoS individually
is not enough . They must also be validated while running simultaneously
to determine the impact of services on each other . The ITU has recently
recommended a test strategy that does this called y .1564 . ITU-T y .1564 is a
two-step test:
1 . A service configuration test, which validates the network configuration
of each defined service (rate limit, traffic shaping and QoS) . It tests such
details as MTU, bandwidth (CIR), CoS prioritization .
2 . A service performance test, which validates the QoS of each defined
service and proves SLA conformance . It reports such metrics as frame
delay, frame delay variation, frame loss ratio, and availability .
Service Provider Tactics to Ensure SLA-backed Ethernet Services
15 • SPIREnT WhITE PAPER
figure 7 below shows an example of a RfC 2544 test run on a single class of
service, and the ITU-T y .1564 service performance test with four individual
classes of service streams run in parallel, reporting the integrated QoS .
Figure 7. Current RFC 2544 test and the new ITU Y.1564 service performance test examples
RFC 2544 Test
ITU Y.1564 New Standard
Unlike the typical residential service that doesn’t provide SLA guarantees, large
business customers require proof of conformance documentation to verify
that they are receiving the service for which they contracted . Supplying this
proof requires a system that maintains a history of service issues and calls and
supports the ability to document the issues and their resolution . This ability
is especially important for PM and maintenance, but it begins with a birth
certificate record showing the date, time, and performance of the service at
turn up .
Service Provider Tactics to Ensure SLA-backed Ethernet Services
SPIREnT WhITE PAPER • 16
Figure 8. Optimizing service activation via remote controlled Service Assurance devices
NTE with
Loopback
NTE with
Loopback
NTE with
Loopback
NTE with
Loopback
NTE with
Loopback
NTE with
Loopback
NTE with
Loopback
NTE with
Loopback
Service Assurance Probes
The ability to ramp-up these Ethernet connections for mobile backhaul or
business services correctly the first time is a challenge that requires the MSO
to find a solution that is scalable and sustainable . Some organizations have
responded by throwing people at the problem . But how many more people can
be hired? how can management effectively coordinate them for dispatches to
dozens of locations for thousands of circuits? how can they be educated and
retained? In reality, it becomes a business problem—how can the service be
grown, thus revenues, faster than the staff are grown to support it? The answer;
remote controlled devices to enable service activation without the dispatch and
without waiting .
Service Provider Tactics to Ensure SLA-backed Ethernet Services
17 • SPIREnT WhITE PAPER
PERfORMAnCE MAnAgEMEnT
As shown in Table 3, there are specific requirements for network availability,
packet loss, and latency, and resulting penalties if the provider does not meet
them . how does a service provider ensure that the quality being provided to
their enterprise customers or service provider business partners is correct? What
happens if the customer or business partner insists that the quality does not
meet the SLA?
Unlike TDM services in which the network elements gather statistics on quality
and availability, in an Ethernet service the network elements provide minimum
visibility to the quality of the service . Some devices may track raw frame counts
or perhaps monitor a heartbeat between endpoints, but minimum visibility to
the quality of the service is available, and that only at the most basic level .
There is no native tracking of QoS for one level of service, much less several .
Because Ethernet devices do not track this information, additional tools are
required to fill the gap .
Tracking KPis
Delivering on an SLA requires confirming 24 hours a day, 7 days a week that
specific, contracted KPI targets are being met continuously . Performance
matters and is quickly becoming the differentiator for competitive Ethernet
services . The offered SLA can set a new provider apart, but it must be backed
up, a fundamental change from residential service that affects operations .
The focus has shifted from network QoS to service quality or Quality of
Experience (QoE) .
Network Availability
how would a service provider know—before the customer calls—that the
network has become unavailable? for Ethernet, an outage is the duration of
time when a service cannot send or receive packets at the various frame/packet
sizes . In the past, SLAs specified only the ability to send or receive packets, but
today’s applications have moved far beyond best effort service .
SLA KPIs are evolving as availability is applied to the multiple CoS levels
dictated by the customer applications, requiring even more visibility into
performance for each CoS . This visibility requires PM tools that can be
customized and track specific service requirements across the system .
Packet Loss
how do you confirm that traffic transits the network without dropping packets?
has a traffic misconfiguration reduced the CIR the service can transport, or is
some portion of the network now experiencing degradation, causing frames
to be damaged and thus dropped by the receiving switch? Some applications,
such as web applications, perform well in the presence of frame loss, where
other applications, such as video conferencing or live medical videos, are more
sensitive to loss .
Service Provider Tactics to Ensure SLA-backed Ethernet Services
SPIREnT WhITE PAPER • 18
Latency
how do operations confirm that traffic transits the network without exceeding
the latency requirement? Is congestion in some portion of the network causing
packets to be queued and thus latency to increase? financial transactions are
particularly affected by even a few milliseconds of latency and can have a huge
monetary impact on customers .
Active Monitoring
Passive monitoring, which tracks KPIs based on monitoring customer traffic,
places the operator at a significant disadvantage when SLA violations occur,
because by the time the operator discovers that an issue exists using this
technique, the customer is already experiencing the degradation, first hand .
Moreover, network problems that surface during times where there is no
customer traffic on the network remain undetected . Passive monitoring does not
provide an assessment of delay or jitter performance and loss is merely inferred
from frame counters instead of measured directly .
To assure SLA compliance, an MSO must measure KPIs 24 x 7 per class of
service through active monitoring, which generates test traffic designed
specifically to report on KPIs across the network .
Performance Monitoring is not a field service tech activity . hand held testers
are not capable of active monitoring because they provide only a snapshot of
the state of the network, not a continuous monitoring with historical archives .
Besides functionality issues, the multiple coordinated dispatches required to
test a service with a handheld increases OpEx and reduces MTTR . In addition,
provider edge network elements are not optimized to aggregate KPIs per class of
service per EVC .
Service Provider Tactics to Ensure SLA-backed Ethernet Services
19 • SPIREnT WhITE PAPER
Figure 10. Example of performance monitoring crossing thresholds
PM as a Management Tool
Active PM allows you to manage the network, proactively addressing issues
before they result in SLA violations . Setting the thresholds and alerting
operations of declining performance before it impairs customer traffic can result
in significant
savings by
avoiding penalties
and credits .
A threshold
violation can
trigger an alarm,
requiring a
technician to
run further tests
or to alert the
subscriber of the
situation .
To perform the active monitoring required to avoid SLA violations and penalties,
the nOC must depend on a 24 x 7, always-on tool that can view the history of
the quality of the service end-to-end via real-time, active testing is required .
Figure 9. Using Active Traffic Generation to measure QoS 24 x 7 via Service Assurance probes
NTE
NTE
NTE
NTE
NTE
NTE
NTE
NTE
Service Assurance Probes
Low bandwidth traffic
Service Provider Tactics to Ensure SLA-backed Ethernet Services
SPIREnT WhITE PAPER • 20
PM as Triage
Active PM can also save OpEx by allowing an MSO to triage performance
degradation before dispatching a field technician . for example, suppose a
customer has a problem sending traffic from office A to office B . Many service
providers would do a coordinated dispatch, sending a field technician to both
ends to run tests . With an active PM tool, a single technician at the nOC can
instead use segmentation to locate the problem, verify that it still exists, and
direct the dispatch to the needed office with the knowledge, parts and tools to
solve the problem in a single visit, all with historic data and a clear action plan .
historical information can highlight repeats and chronic outages that may point
to a specific source for the outage . Performance monitoring pulls double duty,
both as a monitoring tool and as an active testing tool .
Maintenance
Each of the SLAs in Table 1 has a requirement for a 24 x 7 call center to support
trouble calls and to repair the service within the MTTR threshold . The residential
model for working on service troubles between 8:00 a .m . and 5:00 p .m . is
not viable, and certainly isn’t competitive . Even with people on call 24 x 7, a
technician must respond within the window to avoid the penalty . The nature
of the customer’s business depends on service availability and could mean
lost revenue . Meeting MTTR metrics increases the pressure and the stakes .
In addition to proactive monitoring, a process that assures SLA compliance is
required to address reactive maintenance . The MTTR is running, even at 3:00
a .m . Where are the field technicians sent? how many are needed? figure 11
shows the typical business process for a major outage for a business customer—
send field technicians to find the problem .
Figure 11. Dispatch field technicians to find the fault
Service Provider Tactics to Ensure SLA-backed Ethernet Services
21 • SPIREnT WhITE PAPER
A service assurance solution allows the service provider to perform a rapid
remote test from the nOC, segment the link, and identify trouble before
dispatching a technician . With the right tools and processes in place, in many
cases t problems can be resolved without ever dispatching a field technician .
hard failures, in particular, are easy to discover and resolve . Intermittent
problems are the source of the time drain—random symptoms, complaints of
slow service . The 80/20 rule applies—most issues are addressed rapidly, leaving
the technicians available to address the 20 percent that require more time .
Operationally, if a technician must be dispatched to fix the problem it is critical
that the technician is sent to the correct location the first time .
In the SLA environment, every customer call, regardless of how trivial it may
seem, must be addressed with documentable action . Achieving that goal
requires a solution that provides:
• A remote controlled test and diagnostic capability to rapidly identify
troubles on the enterprise service
• The ability to isolate troubles rapidly across the network, thus
dispatching a technician to fix a problem instead of being dispatched to
find the problem
• The increase in the aggregate expertise level of technicians on duty
through automated scripts and connectors, allowing even junior
technicians to troubleshoot effectively
• Archiving historical records and documentation of network health, test
results, previous repair efforts, and PM results for analysis
Service Provider Tactics to Ensure SLA-backed Ethernet Services
SPIREnT WhITE PAPER • 22
Figure 12. Remote controlled service assurance probes working with the network elements replace field
technicians for trouble shooting
NTE with
Loopback
NTE with
Loopback
NTE with
Loopback
NTE with
Loopback
NTE with
Loopback
NTE with
Loopback
Service Assurance Probes
Field tech dispatched
figure 12 below shows how remote controlled service assurance probes working
with the network elements in the network and at the enterprise can identify
and sectionalize the problem . Therefore, the service provider must send a field
technician to the right place to fix the problem, only one instance, not to find it .
Landing high-revenue customers is a high-stakes business . With any network
or service, the emergence of problems is inevitable . The competitive nature of
maintenance favors the provider that can resolve the problem quickly to avoid or
minimize disruption to the subscriber .
Service Provider Tactics to Ensure SLA-backed Ethernet Services
23 • SPIREnT WhITE PAPER
forward-looking service providers will develop the ability to monitor managed
services and specific applications and provide tiered services . SLAs that provide
QoS metrics for each CoS are required . As more enterprises move their systems
to cloud-based applications to reduce their IT costs, more stress will be placed
on the service provider to deliver a higher quality network .
To win in the evolving highly-competitive business service market, it is critical
for service providers to offer strongly competitive SLAs . Providers must deploy
a service assurance strategy focused on remote probes for service turn-up,
performance monitoring, and troubleshooting, and optimize the use of field
technicians to reduce operational costs and deliver SLA compliance . Using
this strategy, providers can gain market share, retain customers, and exceed
revenue expectations in the Ethernet market .
COnCLUSIOn
The opportunities are big, the stakes are high, and those who succeed will learn
to navigate the competitive world of Ethernet SLAs .
This requires the MSO to optimize the process of managing service to KPI targets
by translating SLA requirements into operational actions as shown in figure 13 .
Figure 13. Win at the competitive world of Service Level Agreements
• Probes in the network act like technicians
• NTE devices with assurance loopback
• Avoid dispatches, turn up right … the first time
PERFORMANCE
MONITORING
• Active traffic to measure KPIs
• 24x7 “always on” quality
• Deep visibility to SLA-based KPIs
TROUBLE
MANAGEMENT
• Remote test probes
• Identify and sectionalize
• Dispatch to fix—not find
SERVICE
ACTIVATION