Jump to content, skipping navigation

Testing Audio/Video Quality in the Home Network

    * Required Field

    Cancel

    testing audio/video Quality in the home network August 2009 Rev. A 08/09 Application Note sPirent 1325 Borregas Avenue Sunnyvale, CA 94089 USA Email: sales@spirent.com Web: http://www.spirent.com americas 1-800-SPIRENT • +1-818-676-2683 • sales@spirent.com euroPe and the middle east +44 (0) 1293 767979 • emeainfo@spirent.com asia and the Pacific +86-10-8518-2539 • salesasia@spirent.com © 2009 Spirent. All Rights Reserved. All of the company names and/or brand names and/or product names referred to in this document, in particular, the name “Spirent” and its logo device, are either registered trademarks or trademarks of Spirent plc and its subsidiaries, pending registration in accordance with relevant national laws. All other registered trademarks or trademarks are the property of their respective owners. The information contained in this document is subject to change without notice and does not represent a commitment on the part of Spirent. The information in this document is believed to be accurate and reliable; however, Spirent assumes no responsibility or liability for any errors or inaccuracies that may appear in the document. testing audio/video Quality in the home network CoNTENTS Introduction . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 1 Audio/Video Technologies overview. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 2 Video Encoding and Compression. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 2 Audio/Video Stream Encapsulation and Transport . . . . . . . . . . . . . . . . . . . 3 IP Transport . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 4 Factors Affecting Audio/Video Quality . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 5 Video Codec and Codec Bit Rate . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 5 Packet Loss . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 5 Packet Delay Variation . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 6 Audio/Video Quality Metrics . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 7 Quality of Experience Metrics . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 7 Expert Analysis Metrics . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 7 Video Stream Metrics . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 8 Transport Quality Metrics . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 9 Testing Audio/Video Quality Methodology . . . . . . . . . . . . . . . . . . . . . . . . . . 11 Summary . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 15 SPIRENT APPLICATIoN NoTE • i Testing Audio/VideoQuality in the Home Network 1 • SPIRENT APPLICATIoN NoTE 1. INTRoDUCTIoN over the past few years, advances in the technologies used to create, store and deliver audio/video content have enabled a substantial increase in the number of audio/video programming choices and services available to consumers. The efficiencies realized through sophisticated audio/video compression techniques, storage solutions and packet-based transports have been leveraged by audio/ video service providers to expand program lineups and offer consumers an array of compelling new services—high-definition television, video-on-demand, multi- room digital video recorders and more. The same technological advancements also have contributed to a competitive shift among audio/video service providers, including the entrance of many more diverse organizations into the industry. Today, traditional audio/video service providers such as cable and satellite television operators compete vigorously with non-traditional service providers such as telcos, mobile network operators and “over-the-top” Internet video providers to attract new customers and retain their current customer bases. The ability to assure the quality of the viewing experience will be one of the key factors that will allow audio/video service providers to attain an advantage in this highly competitive environment. This should include investments in service assurance equipment which covers the core, edge and access networks, and most importantly, the customer’s home network. Increasingly, the home network is becoming the responsibility of the audio/video service provider and is often the portion of the service delivery network most susceptible to quality issues. This application note describes how portable test tools may be used in the home to support the delivery of audio/video service and to guarantee a satisfactory quality of experience. First, a brief overview of the technologies used to create and deliver audio/video content is provided, including a description of the factors that may contribute to quality degradation in the home. Next, the application note discusses the key performance metrics provided by Spirent’s Tech-X Flex field test set to assess the quality of the viewing experience. Finally, the methodology for using the Tech-X Flex to measure audio/video quality, identify the root causes of quality degradation and resolve issues that adversely affect the customer viewing experience is offered. Testing Audio/Video Quality in the Home Network SPIRENT APPLICATIoN NoTE • 2 2. AUDIo/VIDEo TECHNoLogIES oVERVIEW 2.1. video encoding and compression A digital video is a series of pictures or frames with a common resolution which are displayed sequentially at a specific rate. Typically, digital video includes a significant quantity of redundant information which may be compressed to reduce the amount of capacity that is needed to transfer and store the images. Most video codecs use two types of compression—spatial compression and temporal compression. Spatial compression operates on a single frame and leverages the fact that the color and brightness of the objects included in a frame may be similar over a given area. For example, an outdoor scene may have images in which a significant portion of the blue sky has adjacent pixels that are very similar. Rather than encoding each pixel individually, spatial compression encodes only the differences between neighboring pixels. This technique reduces the amount of information needed to display the frame. By contrast, temporal compression works over multiple frames. Because a series of digital video frames typically has a large amount of redundant data (for example, background objects that are not moving), temporal compression techniques are used to encode only the differences between successive frames. This significantly reduces the quantity of data needed to display the frames, allowing the realization of substantial network and/or storage capacity savings. Common video codecs organize a series of successive digital video frames encoded with spatial and temporal compression to form a group of pictures (goP). The spatially-compressed frames in a goP are called “intra-coded” frames or “I-frames,” which contain all of the information needed to independently render a full digital video frame. The frames included in the goP encoded with temporal compression are called “inter-coded” which reflects their dependency upon other frames in the goP to be reconstructed. Most video compression techniques use two types of inter-coded frames – “P-frames” and “B-frames”. A P-frame (or “predictive coded frame”) is coded using the changes from the most recent I-frame or P-frame in the goP. A B-frame (or “bidirectional predictive coded frame”) is coded using either the changes from the previous I-frame or P-frame in the goP, the following I-frame or P-frame, or a combination of both the previous and following I-frames or P-frames. Note that a typical goP includes one I-frame followed by a variable number of P-frames and B-frames. The following diagram shows a sample goP: Testing Audio/VideoQuality in the Home Network 3 • SPIRENT APPLICATIoN NoTE 2.2. audio / video stream encapsulation and transport The methods used to encapsulate and transport compressed audio and video follow a layered model much like the oSI protocol stack. Initially, the compressed audio or video becomes a unit called an elementary stream. The elementary stream includes header information which allows the audio/video content to be organized and decoded properly. Elementary streams then are divided into smaller units called packetized elementary streams. These streams contain additional information such as the timestamps needed to decode and present the audio/video content correctly. Then, multiple packetized elementary streams are multiplexed into a unit called a transport stream which includes one or more audio/video programs. Typically, the transport stream is divided into 188-byte packets for transmission and additional headers are appended to allow the transport stream to be reconstructed by the decoder. The most important transport stream header fields include the program identifier (PID) which associates a transport stream packet with a specific program and the program clock reference (PCR) which provides the decoder with the time reference needed to present and synchronize a program’s audio/video streams. The following diagram illustrates how elementary streams are organized into programs and transport streams: Testing Audio/Video Quality in the Home Network SPIRENT APPLICATIoN NoTE • 4 A decoder needs to understand how to de-multiplex a transport stream to present a specific program to the viewer. This is accomplished using a number of tables which are included in the transport stream by the encoder. Each table forms an elementary stream which is assigned a PID. Collectively, these tables are called program specific information. The primary table is called the program association table (PAT) which identified using PID 0. Essentially, the PAT includes a list of other tables called program map tables (PMTs) included in the transport stream with their corresponding PIDs. one PMT is included in the transport stream for each audio/video program and contains a list of the elementary streams that form the respective program, along with the associated PIDs. The following tables illustrate the program specific information included in the transport stream. 2.3. iP transport Commonly, two methods are available for encapsulating and delivering audio/ video services to residential customers over IP networks. one method is placing 188-byte transport stream packets directly into UDP segments which are then encapsulated using IP. A UDP segment may carry between one and seven transport stream packets, with seven packets per segment most typically used to maximize network utilization. A second method is defined by IETF RFC 2250 which describes how transport stream packets may be encapsulated using the real-time transport protocol (RTP). Usually, transport stream packets encapsulated using RTP are transported using UDP / IP. Using RTP to encapsulate transport stream packets provides a number of operational advantages. This includes using RTP timestamps and sequence numbers for error recovery and retransmission; and, RTP also provides performance metrics that may be useful for service providers. A tradeoff exists, however, between the benefits realized from the information included in RTP headers and the additional network capacity needed to transport the headers. The following diagrams show how transport stream packets are typically encapsulated for IP transport. Testing Audio/VideoQuality in the Home Network 5 • SPIRENT APPLICATIoN NoTE 3. FACToRS AFFECTINg AUDIo/VIDEo QUALITy Audio/video streams compressed using common codecs and delivered to consumers using IP packet-based transports are quite susceptible to a number of factors that may reduce quality. generally, the factors contributing to quality degradation may be grouped into three broad categories—video codec selection and codec bit rate, packet loss, and packet delay variation. 3.1. video codec and codec Bit rate A relationship exists between the amount of compression applied to a video stream and the quality of the video, where more compression reduces quality and less compression increases quality. This relationship has a few dependencies such as the type of video codec selected, how the codec bit rate is controlled (either a constant or variable bit rate) and the resolution of the compressed images. The degradation caused by compression may include blockiness, blurring, jerkiness, pixelization or other impairments visible to the viewer. Since codec selection and bit rate typically are attributes controlled by systems engineers at the head end, degradation from codec may be differentiated from degradation caused by network transport such as packet loss or packet delay variation. 3.2. Packet loss Packet loss which affects quality may occur due to a number of factors including network congestion, reduced network capacity, and physical and logical link errors. often, packet loss occurs in “bursts” or periods during which significant numbers of packets are dropped. Techniques like forward error correction (FEC), interleaving, packet loss concealment and error recovery/retransmission commonly are applied to protect audio/video streams from packet loss. yet, occasionally packet loss is sufficient enough to exceed the capabilities of these techniques and severe quality degradation can occur. Even minimal amounts of uncorrected packet loss will affect a video stream and may be noticed by the viewer. Typically, the degradation from packet loss observed by the viewer will appear to be distorted blocks or slices of the video stream, black screens, screen “freezes”, or gaps in the audio stream. Testing Audio/Video Quality in the Home Network SPIRENT APPLICATIoN NoTE • 6 Understanding how packet loss affects a video stream requires special consideration. Recall that the video codecs used by most service providers combine a sequence of successive frames into goPs, each of which includes one intra-coded I-frame and a series of inter-coded P-frames and B-frames that represent the motion vectors from prior frames. Packet loss affecting the I-frame or a P-frame in the group of pictures will propagate through all of the following P-frames and B-frames and significant degradation can occur. Packet loss affecting a B-frame, however, will not affect the other frames included in the goP and therefore degradation may be limited or even not noticed by the viewer. Thus, the effects of packet loss on the quality of a video stream will vary depending on where the packet loss occurs within the stream. The following figure illustrates this concept. 3.3. Packet delay variation Packet delay variation often is caused by network congestion, limited network capacity or the grouping of many audio/video streams (typically multicast) into the same queues, buffers and schedulers of the network elements that audio/ video streams transit from their source towards their destinations. Packet delay variation may cause audio/video streams to overflow or underflow the de-jitter buffers of decoders which are designed to allow the audio/video streams to be presented to the viewer at a constant rate. Decoders will discard audio/video packets which arrive at the buffer too early (overflow) or too late (underflow) and quality degradation will subsequently occur. The effects of quality degradation observed by the viewer are similar to those caused by packet loss, such as distorted blocks or slices, black screens and/or “freezes.” Audio distortion or gaps in the audio may also be present. Testing Audio/VideoQuality in the Home Network 7 • SPIRENT APPLICATIoN NoTE 4. AUDIo/VIDEo QUALITy METRICS The following sections describe the metrics reported by Spirent Tech-X Flex which may be utilized to assess the quality of the audio/video streams delivered to the customer’s home. 4.1. Quality of experience metrics The Tech-X Flex uses a mean opinion score (MoS) to describe the overall quality of the audio/video streams under test. The MoS is a number on a scale between 1.0 (worst quality) and 5.0 (best quality) that represents how a person viewing the program would rate the experience. Typically, a MoS greater than 4 indicates a stream that will provide a satisfactory experience. A MoS between 3.0 and 4.0 describes stream that has suffered some quality degradation and will offer a sub-optimal viewing experience. A stream rated below 3.0 has significant quality degradation and a viewer would likely consider the viewing experience unsatisfactory. The following table includes more detailed descriptions of the MoS values reported by the Tech-X Flex. metric Description mos-v A MoS that describes the overall quality of the video stream included in the program under test. This metric considers the video codec bit rate, goP type and length and packet loss distribution. mos-a A MoS that describes the overall quality of the audio stream included in the program under test. The metric considers audio codec bit rate, audio sampling rate and packet loss distribution.. mos-av A MoS that describes the overall quality of the audio and video streams present in the program under test. This metric considers audio and video quality as well as the synchronization between the audio and video streams. The audio/video program’s MoS values may be used by technicians to identify quickly whether the quality of the viewing experience will be satisfactory and to assess whether performance issues may need to be addressed. 4.2. expert analysis metrics The Tech-X Flex offers an expert analysis of the measurements performed during audio/video quality testing. The expert analysis is designed to isolate the cause(s) of the stream degradation that may be affecting the viewing experience. The expert analysis provides three metrics which are defined in the table below. Testing Audio/Video Quality in the Home Network SPIRENT APPLICATIoN NoTE • 8 metric Description degradation from loss The percentage of overall quality degradation caused by packet loss. degradation from Jitter The percentage of overall quality degradation that may be attributed to de-jitter buffer packet discards (overflows or underflows). degradation from codec The percentage of overall quality degradation caused by stream com- pression. Some percentage of degradation from the codec is always displayed by the Tech-X Flex since the analysis considers the quality of the original uncompressed audio/video streams. The amount of degradation caused by codec will depend upon the codec type as well as the bit rate of the compressed streams. The information offered by the Tech-X Flex expert analysis can assist with differentiating between the causes of degradation and allow technicians to isolate issues more effectively. For example, degradation from jitter commonly is caused by network congestion. Significant congestion will introduce sufficient delay between audio/video packets and will force the decoder’s de-jitter buffers to either overflow or underflow and therefore discard packets. often, congestion occurs upstream from a customer’s residence, at a customer’s residential gateway or perhaps within the residential network due to competition with other services and applications for network capacity. Quality degradation from packet loss may occur due to network congestion as well; however, other sources of packet loss may be present including link errors and other transmission or physical media issues. Determining the cause of audio/video degradation from packet loss will require a more thorough investigation that addresses all of these factors. 4.3. video stream metrics The Tech-X Flex reports several metrics that describe the specific attributes of the video stream under test, the video stream’s bit rate and the correlation between transport stream packet loss / discards and the video frames affected. The following table provides descriptions of these metrics. Testing Audio/VideoQuality in the Home Network 9 • SPIRENT APPLICATIoN NoTE metric Description image size The horizontal x vertical resolution of the video stream under test expressed using pixels. image type A short description of the video frame resolution. This may be SDTV (standard-definition television) or HDTV (high-definition television). codec type The codec used to compress the video stream under test. Examples include MPEg-2 and H.264. goP type A description of the types of frames included in the goP and the way the frames are organized, for example: IBBPBB… goP length The total number of I, P and B frames included in a goP by the video stream’s encoder. frames received The total number of frames received during the measurement, organized by type – I-Frames, P-Frames and B-Frames. Packets lost The total number of MPEg-2 transport stream packets lost during the measurement by frame type. Packets discarded The total number of MPEg-2 transport stream packets discarded during the measurement by frame type. frames impaired The total number of frames by type impaired during the measure- ment due to transport stream packet loss and/or discards. receive rate The average rate at which the video stream is received by the test set expressed using kilobits per second. Peak receive rate The peak rate at which the video stream is received by the test set expressed using kilobits per second. Note that the Tech-X Flex video stream image, codec, goP and frame metrics provided may be used to optimize codec selection and the goP type and length selected for the video stream. 4.4. transport Quality metrics The Tech-X Flex provides several metrics that allow technicians to assess the quality of the network transporting the stream under test. generally, these metrics characterize the packet loss and packet delay variation added to the stream while transiting from the source to the measurement point. The following table provides further description of these metrics. Testing Audio/Video Quality in the Home Network SPIRENT APPLICATIoN NoTE • 10 metric Description Packets lost The total number of transport stream packets lost during the mea- surement period. Packets discarded The total number of transport stream packets discarded by the de- jitter buffer emulator during the measurement period due to over- flows and/or underflows. Packets out-of-sequence The total number of transport stream packets received out-of-se- quence during the measurement period. gap Period loss rate The average packet loss rate measured during gap loss periods. gap Period loss length The average packet loss length measured during gap loss periods expressed in milliseconds. Burst Period loss rate The average packet loss rate measured during burst loss periods. Burst Period loss length The average packet loss length measured during burst loss peri- ods expressed in milliseconds. maPdv The mean absolute packet delay variation. This is the delay varia- tion experienced by stream packets relative to the short term running average of the delay calculated for prior packets. PPdv The packet-to-packet delay variation. This is the statistical vari- ance of the delay experienced by consecutive stream packets over the measurement duration. The packets lost/discarded metrics, as well as the gap/burst loss metrics, allow technicians to more accurately qualify the “burstiness” of packet loss which directly corresponds with the viewer’s quality experience. The reported packet delay variation metrics may be compared to the de-jitter buffer sizes supported by decoders to permit technicians to estimate whether or not overflows or underflows will occur. Additionally, the packet loss and delay variation metrics may be correlated with service level specifications established for the customer to determine whether or not satisfactory quality of service is provided. The Tech-X Flex also reports the media delivery index (MDI) which is a set of metrics that describe the quality of the network used to transport the audio/video streams. MDI includes two measurements—media loss rate (MLR) and delay factor (DF). The MLR is the number of transport stream packets lost per second. The DF describes the variation of the bit rate of the transport stream under test relative to a constant bit rate and calculates the amount of de-jitter buffer (or time) that will be needed to accommodate the bit rate variation and avoid buffer overflow and underflow. Essentially, the DF estimates the size of the de-jitter buffer required to assure satisfactory audio/video quality. The following table provides descriptions of the MDI metrics provided by the Tech-X Flex. Testing Audio/VideoQuality in the Home Network 11 • SPIRENT APPLICATIoN NoTE metric Description mlr The number of transport stream packets lost per second. average df The average delay factor calculated over the measurement duration. minimum df The minimum delay factor calculated over the measurement duration. maximum df The maximum delay factor calculated over the measurement duration. receive rate The actual rate of the transport stream under test expressed us- ing kilobits/second. 5. TESTINg AUDIo/VIDEo QUALITy METHoDoLogy Testing the quality of a residential audio/video service commonly occurs throughout the service lifecycle, including pre-qualification to determine whether or not the service may be offered, following equipment installation and service provisioning, after equipment and/or service upgrades and while responding to troubles which affect the service quality. The following information describes the methodology for using the Tech-X Flex to measure and verify the health of an audio/video service and generally applies to each of these situations. The Tech-X Flex operates using two modes—host mode and passive mode. Host mode allows the Tech-X Flex to emulate the customer’s set top box (STB) to validate audio/video quality. Typically, host mode testing is used to assess audio/ video streams transported using IP multicast. The following diagram illustrates how the Tech-X Flex is connected to the residential network to conduct testing using host mode. Testing Audio/Video Quality in the Home Network SPIRENT APPLICATIoN NoTE • 12 Testing using passive mode uses two Tech-X Flex interfaces which allows the Tech-X Flex to be connected “in-line” with the audio/video stream on the residential network. This configuration allows the stream to be tested while simultaneously being viewed on and controlled by customer equipment. often, passive mode testing is used to measure streams that are transported using IP unicast such as video-on-demand (VoD) or multi-room DVR programs. The following diagram shows how the Tech-X Flex is attached to the home network using passive mode. After determining whether host mode or passive mode should be used, multiple measurement points targeting each segment of the home network may be tested to qualify the service and equipment or locate specific troubles. The following diagram illustrates different possible measurement points. Testing Audio/VideoQuality in the Home Network 13 • SPIRENT APPLICATIoN NoTE Typically, the first measurement point should be “upstream” from the residential gateway to evaluate the quality of the access network link (e.g. DSL, MoCA) and to determine audio/video quality before the stream transits the home network. The MoS values (MoS-V, MoS-A, MoS-AV) reported by the Tech-X Flex may be used to quickly verify quality. Remember that MoS values greater than 4 typically mean that quality is satisfactory at the measurement point, otherwise quality issues may exist. The following image illustrates how MoS values are reported by the Tech-X Flex. Multiple quality measurement reports with MoS values below 4 demonstrate that the quality of the streams and/or network upstream from the home will not provide satisfactory service for the customer. In this case, a technician may refer the issue to the appropriate operations center. otherwise, a technician may proceed to test directly downstream from the residential gateway, shown as measurement point 2 in the previous diagram. The same testing procedure may be followed using the same MoS criteria, with unsatisfactory MoS values leading to the investigation of the residential gateway and related interfaces and links. otherwise, if the residential gateway is performing properly, a technician may continue to validate quality throughout the home network by testing at STBs, shown in the previous diagram as measurement point 3. This allows the technician to verify the links between the residential gateway and the STBs and to differentiate between issues occurring on the residential network and problems associated with the customer’s equipment. Testing Audio/Video Quality in the Home Network SPIRENT APPLICATIoN NoTE • 14 After audio/video quality measurements which report unsatisfactory MoS at any measurement point within the home network, a technician may leverage the additional metrics available with the Tech-X Flex to further identify and resolve service issues. The technician should start this process by determining the factor(s) that are causing the quality degradation such as video codec, packet loss and/or packet delay variation. The Tech-X Flex expert analysis, displayed in the following image, summarizes this information. The technician then may pursue the root cause of the issue, perhaps network congestion or reduced network capacity causing degradation from jitter (packet delay variation), or possibly physical/link layer issues causing degradation from loss. The additional packet delay variation and packet loss metrics provided by the Tech-X Flex may be used to fully understand the characteristics of the quality degradation and troubleshoot the issue more completely. Testing Audio/VideoQuality in the Home Network 15 • SPIRENT APPLICATIoN NoTE 6. SUMMARy Competition between audio/video service providers has escalated significantly over the past few years and the ability to assess the home viewing experience can be a key factor when attracting and retaining customers. This effort requires advanced test tools that accurately assess the quality of the audio/ video at the portion of the service delivery network most at risk—the customer’s home. Spirent’s Tech-X Flex in-home field tester has a complete audio/video analysis solution which may be used throughout the lifecycle of the audio/video service to guarantee the quality of experience and allow providers to install equipment, provision services and troubleshoot and resolve quality issues more efficiently. Additional information about Spirent’s Tech-X Flex is available at www.spirent.com or by contacting your Spirent account manager