NetworkMonitoring PDF
NetworkMonitoring PDF
SLA Monitoring
Hyo-Jin Lee, Myung-Sup Kim and James W. Hong*, Gil-Haeng Lee**
*Distributed Processing & Network Management Lab.
Dept. of Computer Science and Engineering, POSTECH, Pohang Korea
**Network Technology Research Lab., ETRI, Daejeon Korea
({really97, mount, jwkhong}@postech.ac.kr , [email protected])
Abstract
Service Level Agreement (SLA) is a formal negotiated agreement between a service provider
and a customer. The service level management (SLM) is the integrated management of all
functionalities in the SLA life cycle. When a customer orders a service from a service provider, an
SLA is negotiated and then a contract is made. In the SLA contract, QoS parameters that specify the
quality level of service that the service provider will guarantee are included. The service provider
must perform SLA monitoring to verify whether the offered service is meeting the QoS parameters
specified in the SLA. SLA monitoring involves monitoring the performance status of the offered
service and provide relevant information to the service level management system. In order for the
service level management system to verify whether the specified QoS parameters are being met, the
system must gather performance data from the underlying network performance monitoring system
and map such data to the QoS parameters. In this paper, we propose a formal mapping mechanism
between QoS parameters in SLA and the network performance metrics. Although we focus on the
network access service (e.g., leased-line service, xDSL service, VPN service) in this paper, we
believe that our mapping mechanism can be easily used in SLA monitoring of other services (such
as application, server hosting, contents). We also propose a general SLA monitoring system
architecture that can be used to monitor service levels for various services offered by network,
Internet and application service providers. Finally we present how our SLA monitoring system
architecture can be used for SLA monitoring of IP backbone network service.
Keywords: SLA, Service Level Management, SLA monitoring, QoS Parameters, Network
Performance Metrics, Measurement Mapping, Evaluation Mapping.
Output
Service Level Management
SLA Monitoring
SLA creation SLA assessment
Customer
SLM is the integrated method to manage various SLA monitoring, which is important to assure the degree
SLAs from creation to assessment. We categorize the of QoS parameters and to use the monitoring results in
SLM into seven functions: SLA creation, negotiation, reporting and assessment. In other words, this paper
provisioning, monitoring, maintenance, reporting and presents a new concept of SLA monitoring, including
assessment, as illustrated in Figure 1. SLA creation three processes of Telecom Operations Map (TOM) [6],
creates an SLA templete for specified services. SLA which are network data management process at the
negotiation is the process of selecting applicable QoS network management layer, service problem
parameters in SLA and negotiating the penalty in case of management, and service quality management at the
SLA violation. SLA provisioning means that service service management layer. There are two types of input
providers configure the network element or topology to in SLA monitoring, as illustrated in Figure 2. One is
provide the service. After provisioning, service providers various QoS parameters according to services and the
must verify the degree of SLA assurance, which they other is SLA contract for each customer. After SLA
contracted with customers. To perform surveillance on monitoring uses network monitoring system, the output
QoS parameter degradation or violation, SLA monitoring of SLA monitoring is also dual. One is problem
is needed. When a violation of a QoS parameter is notification and the other is performance data.
Connectivity
Availability
Availability Functionality
MTBF Capacity unavailable time to the above formula, we can verify the
MTRS Utilization Bandwidth
Throughput Availability of QoS parameter.
Evaluation Mapping Measurement mapping and evaluation mapping are
Availability of device unfunctional time + disconnected time
QoS Parameter = 100 − ×100 (%) essential for SLA monitoring. These mappings are highly
total monitored time
First, we explain measurement mapping in network NPMs for a QoS parameter are decided, the value of
service. Measurement mapping is the process of NPMs is measured by using some network monitoring
determining some NPMs to a QoS parameter. methods. Afterwards, we can apply the measured value
Availability of QoS parameter is mapped to Availability of NPMs to the evaluation function from evaluation
of NPM. Availability of QoS parameter means the mapping. After obtaining the value, we can assess a QoS
work well or not. Delivery of QoS parameter is mapped We present the generic mapping between QoS
to Loss or Delay of NPM, but this depends on the parameters and NPMs, as illustrated in Figure 6. We
decision of service providers. Latency of QoS parameter formalize the mapping using the theory of set and
means Delay of NPM: one-way delay, round trip delay functions. Through this formalization, we make clear the
and delay variance. Bandwidth of QoS parameter is concept of QoS parameters to NPMs mapping, which is a
mandatory process in the construction of SLA verify a QoS parameter from measured NPM values.
monitoring system. Evaluation function, which is element of set E, can be
N
First, we define the three sets : Q, N and E. Set Q different for each element of Set 2 . However, it is
represents a set of QoS parameters, set N represents a set difficult to define the evaluation function according to
of NPMs and set E is the set of evaluation functions. QoS parameters, and it also depends on the type of
N
Set 2 is the power set of N. The relationship between service.
N
set Q and set 2 is the measurement mapping: m(x), In this paper, m(x) and n(x) are defined as the QoS
N
and the relationship between set 2 and set E is the parameters to NPMs Mapping. Therefore, to evaluate
evaluation mapping: n(x). QoS parameters in SLA monitoring, it should be decided
Monitoring module
agent (sa), traffic agent (ta) and network agent (na)) [13].
Figure 7. SLA Monitoring System Architecture
The evaluation function should be determined for each
QoS parameter from network performance values
Customer SLA module in the left side of Figure 7
gathered by SLA agents. One should also determine the
stores the information of each customer contract. This
stored data format and the period to be kept in storage.
module should be designed with good flexibility because
The communication data format among each functional
the amount of stored information can vary easily
module is designed to avoid high network traffic, and is
according to the popularity of the service. The LDAP
another critical factor to be considered.
[22] is one possible implementaion solution for the
Customer SLA module.
5. SLA Monitoring System Architecture
The center of Figure 7 shows the Monitoring
In this section, we represent an SLA monitoring
module which is designed in a layered architecture.
system architecture as illustrated in Figure 7. This
There can be multiple SLA agents for one service,
architecture has three main components: the Parameter
because there can be multiple QoS parameters for one
service and multiple NPMs can be mapped to one QoS Therefore, we decided on the QoS parameters for the
parameter. Also, in some cases, multiple SLA agents IP-backbone network service with the following three:
should be deployed at a regionally dispersed place. The Availability, Latency and Loss. The content of SLA
Performance Metric (PM) Collector gathers the negotiation may be as follows: The backbone availability
performance data measured at each SLA agent and sends over 99.99% should be assured. The average round trip
these to the performance analyzer. The quality evaluation time (RTT) should be less than 50 ms. The delivery ratio
for each QoS parameter and each customer SLA occurs should be more than 98.0%. From our definition of QoS
in the performance analyzer. The performance analyzer parameters to NPMs mapping, there should be a decision
evaluates the quality using the evaluation function stored about m(x) and n(x).
in the Map DB and customer contract information in the
Customer SLA DB. The analyzed information is stored Table 2. Measurement Mapping for IP-Backbone
Network Service
at the Data Store module for a certain time period. The
QoS NPM method point period type
stored data format and stored period is an important point Parameter
Availability Functionality active edge 5 min na
to consider. If a problem such as SLA violation is router
Connectivity active edge 5 min ta
detected, the Performance Analyzer sends an alarm router
Latency RTT active edge 5 min ta
message to the Problem Manager, which detects this router
failure or problem and sends an alarm message to an Delivery RT loss active edge 5 min ta
router
administrator or SLA maintenance system immediately.
The Reporting Manager is to create a report for the
First, we describe about m(x) which is illustrated in
billing system and the reporting system from the Data
Table 2. The QoS parameter Availability is mapped to
Store.
two NPMs: functionality and connectivity, which are
monitored at each edge router and every five minutes
6. SLA Monitoring for IP-backbone Network using an active monitoring method. The network agent
Service
(na) is used to acquire availability NPM value and the
traffic agent (ta) is used for connectivity NPM value. In
To validate our proposed architecture, we applied it
the same manner, Latency is mapped to RTT and
to an IP-backbone network service and designed an SLA
Delivery is mapped to round trip (RT) loss. By this m(x)
monitoring system. The service provider of IP-backbone
mapping the monitoring method, point, period and agent
network service belongs to NSP among our previous
type are decided. For the IP-backbone network to
categorization such as KT, NTT and AT&T. The
monitor the given NPMs, we should set up SLA agents at
customer of this service can be any user who is
each edge router which generates test packet every five
connected to this IP-backbone network. These can be
minutes and sends the results to the upper layer PM
companies, organizations, other ISPs using leased line
Collectors.
service, and individuals who are using xDSL service.
The QoS parameters in these service should be decided
with the terms to represent whether the backbone
network is in a healthy state or not.
Table 3. Evaluation Mapping for IP-Backbone amount of data is smaller than that generated by passive
Network Service traffic monitoring. In addition, the Parameter Mapper,
QoS Evaluation Function Map DB and Customer SLA DB which receive data
Parameter
Availability 100 − device unfunctional time + disconnected time ×100 (%) from extraneous systems are located at the same machine.
total monitoredtime
Latency ∑ RTT (m sec)
total number of RTT test packet 7. Conclusion and Future Work
Delivery number of lost packet
100 − × 100 (%) In this paper, we proposed a new concept of SLA
total number of test packet
monitoring: a QoS parameters to NPMs mapping and a
Next, the n(x) mapping is used to determine the generic architecture for the SLA monitoring system. The
evaluation functions to measure the quality of each QoS QoS parameter to NPM mapping combines measurement
parameter, as described in Table 3. As mentioned mapping and evaluation mapping. From this mapping
previously, these evaluation functions are stored in the one can decide which type of SLA agents are needed to
Map DB and are used by the Performance Analyzer. evaluate the quality of the provided service in an easy
and systematic way. We also provided a mapping
Parameter Mapper.
guideline for network services. In the network service,
Map DB.
Customer Business Customer SLA. the QoS parameters can be Availability, Delivery,
System,
Service Planning Latency, Bandwidth, MTRS and MTBF. We also
and Development Performance Analyzer.
System Data store. provided the points of consideration in the design of the
Problem Manager.
SLA Maintenance Reporting Manager.
System, SLA monitoring system and presented its generic
Reporting System, PM Collector.
Billing System
Backbone router.
architecture. We believe that this paper can serve as a
SLA agent.
- loss check guideline for service providers who intend to deploy an
- RTT check
backbone network
- connectivity check SLA-based network service and set up an SLA
- functionality check
data flow
monitoring system.
Figure 8. Layered and Distributed SLA monitoring We plan to complete our validation with the
System in IP Backbone Network Service
implementation of the SLA monitoring system we
designed for the IP-backbone network service. We intend
Figure 8 shows the SLA monitoring system we
to make an overall guideline of the QoS parameter to the
designed for the IP-backbone network service. The SLA
performance metric for various services such as hosting,
agents which are deployed at every edge router measure
application and content services. A refinement of the
network performance metrics using active method and
proposed SLA monitoring system architecture is also
send the measured NPM values to the PM Collectors.
needed to apply to various services.
There can be two or more PM Collectors because the
backbone edge routers are located regionally far away
from each other. It is necessary to deploy multiple PM
References
Collectors to reduce the monitoring traffic.
The Performance Analyzer, Data Store, Problem [1] TM Forum, "Performance Reporting Concepts and
Manager and Reporting Manager can coexist at the Definitions," TMF701 v2.0, Nov., 2001.
same machine because in this kind of monitoring system [2] TM Forum, "Service Level Agreement Management
Handbook," GB917 v1.5, Jun., 2001. Delay Metric for IPPM," IETF RFC2679, Sep., 1999.
[3] T. Choi, S. Yoon, H. Chung, C. Kim, J. Park, B. Lee, [16] G. Almes, S. Kalidindi, M. Zekauskas, "A One-way
T. Chung, "Wise: Traffic Engineering Server for A Packet Loss Metric for IPPM," IETF RFC2680, Sep.,
Large-scale MPLS-based IP Network," Proc. of 1999.
NOMS 2002, Florence, Italy, Apr., 2002, pp. 251-264. [17] G. Almes, S. Kalidindi, M. Zekauskas, "A Round-
[4] S. Blake, et. al, "An Architecture for Differentiated trip Delay Metric for IPPM," IETF RFC2681, Sep.,
Services," IETF RFC2475, Dec., 1998. 1999.
[5] M. Alves, et. al, "New Measurement with the RIPE [18] K. Appleby, et. al. , "Oceano - SLA based
NCC Test Traffic Measurement Setup," Proc. of PAM management of a computing utility," Proc. of the 7th
2002 Workshop, Colorado, USA, 2002. IFIP/IEEE International Symposium on Integrated
[6] TM Forum, "Telecom Operation Map," GB910, v2.1, Network Management, Seattle, WA, USA, May 2001,
Mar., 2000. pp. 855 -868.
[7] CAIDA, "Network Measurement FAQ," Jan 17, 2002, [19] CISCO white paper, "Successful Implementation
http://www.caida.org/outreach/metricswg/faq.xml. Strategies for Service-level Managment," CISCO,
[8] IPPM, http://www.advanced.org/IPPM/. 2000.
[9] Thomas Lindh, “An Architecture for Embedded [20] P. Bhoj, et. al, "SLA management in federated
Monitoring of QoS Parameters in IP-Based Virtual environments," Proc. of IM’99, Boston, MA, USA,
Private Networks,” Proc. of PAM2001 Workshop, May 1999, pp. 293-308.
Amsterdam, Apr., 2001. [21] A. Feldman, et. al, "NetScope: Traffic Engineering
[10] T. Lindh, "A New Approach to Performance for IP Networks," IEEE Network, Vol. 14, No. 2,
Monitoring in IP Networks-Combining Active and Mar./Apr. 2000, pp. 11-19.
Passive Methods," Proc. of PAM 2002 Workshop, [22] W. Yeong, T. Howes, S. Kille, "Lightweight
Colorado, USA, 2002. Directory Access Protocol", IETF RFC1777, Mar.
[11] S. H. Hong, J. Y. Kim, B. R. Cho, J. W. Hong, 1995.
"Distributed Network Traffic Monitoring and Analysis [23] M. Murray, K. Claffy, “Measuring the
using Load Balancing Technology," Proc. of Immeasurable: Global Internet Measurement
APNOMS 2001, Sydney, Australia, Sep., 2001, pp. Infrastructure”, Proc. of PAM2001, Amsterdam, Apr.,
172-183. 2001.
[12] S. Waldbusser, "Remote Network Monitoring [24] RIPE NCC Test Traffic Measurements,
Management Information Base," IETF RFC1757, http://www.ripe.net/test-traffic/index.html.
Feb.,1995. [25] NIMI, http://www.ncne.nlanr.net/nimi/.
[13] L. Lewis., P Ray.,"On the migration from enterprise [26] Surveyor, http://www.advanced.org/csg-ippm/.
management to integrated service level management," [27] NLANR AMP, http://moat.nlanr.net/AMP.
IEEE Networks , Vol. 6, No.1, Jan., 2002, pp 8-14. [28] PingER, http://www-iepm.slac.stanford.edu/pinger/.
[14] J.Mahdavi, V. Paxson, "IPPM Metrics for [29] Skitter,
Measuring Connectivity," IETF RFC2678, Sep.,1999. http://www.caida.org/tools/measurement/skitter/.
[15] G. Almes, S. Kalidindi, M. Zekauskas, "A One-way [30] CoralReef,
http://anala.caida.org/CoralReef/Demos/cerfnet/link. 1992 ~ 1995 Univ. of Western Ontario, 연구교수
[31] WAND, http://wand.cs.waikato.ac.nz/ 1995 ~ 현재 포항공과대학교 컴퓨터공학과 부교수
[32] NLANR PMA, http:// moat.nlanr.net/PMA. <관심분야> 분산처리, 네트워크 트래픽 모니터링,
[33] Internet2, http:// monon.uits.iupui.edu. 네트워크 및 분산 시스템 관리, CORBA, Internet
[34] MAWI, http:// tracer.csl.sony.co.jp/mawi. 관리
이길행
1986 KAIST 전산학과 석사
1996 KAIST 전산학과 박사
이효진 1986 ~ 현재 한국전자통신연구원
2001 동국대학교, 컴퓨터공학과 네트워크연구소 서비스네트워크
학사 연구부 SLA 관리체계연구팀장/책임연구원
2001 ~ 현재 포항공과대학교, 컴 <관심분야> 분산처리, 로드 밸런싱, 네트워크 관
퓨터공학과 석사 과정 리, DBMS
<관심분야> SLA 모니터링, 네트워크 모니터링,
XML 기반의 네트워크 관리
김명섭
1998 포항공과대학교, 전자계산
학과 학사
2000 포항공과대학교, 컴퓨터공
학과 석사
2000 ~ 현재 포항공과대학교, 컴퓨터공학과 박사
과정
<관심분야> 분산처리, 네트워크 트래픽 모니터링,
네트워크 및 분산 시스템 관리
홍원기
1983 Univ. of Western Ontario, 전
산학 학사
1985 Univ. of Western Ontario, 전
산학 석사
1985 ~ 1986 Univ. of Western
Ontario, 전산학과 강사
1986 ~ 1991 Univ. of Waterloo, 전산학 박사
1991 ~ 1992 Univ. of Waterloo, Post-Doc fellow