Table of Contents
Abstract
This chapter shall provide technical background information about protocols and components used in IP telephony. It introduces involved component types, gives detailed information about H.323 and SIP and RTP as well as some notices about Media Gateway Control and vendor specific protocols.
An IP telephony infrastructure usually consists of different types of components. This section shall give an overview about typical components without describing them in a protocol specific context.
A terminal is an communication endpoint that terminates calls and their media streams. Most commonly this is a hard- or software telephone or videophone, eventually enhanced with data capabilities.
There are terminals that are intended for user interaction and those which are automated - like answering machines.
An IP telephony terminal is located on at least one IP address. There may well be multiple terminals on the same IP address but they are treated independently.
Most of the time a terminal has assigned one or more addresses (see Section , “Addressing”). In case that IP telephony servers are used a terminal registers this addresses with its server.
To place an IP telephony call it requires at least two terminals - and the knowledge of the IP address and port number of the terminal to call. Obviously such a situation is not very satisfying for you don't want to remember IP addresses to call persons and you want to be able to call targets on hosts that use DHCP.
As mentioned before terminals usually register their addresses with a server component. The server stores this telephone addresses along with the IP addresses and thus is enabled to map a telephone address to a host.
When a telephone user dials an address the server tries to resolve the given address into a network address. To do so the server may interact with other telephony servers or services. It may also provide further call routing mechanisms like CPL scripts or skill-based routing.
Finally a telephony server is responsible to authenticate registrations, authorize caller and to do the accounting
Gateways are telephony endpoints that allow calls between endpoints that usually wouldn't interoperate. Usually this means that a gateway translates one signaling protocol into another (e.g. SIP/ISDN Signaling gateways), but translating between different network addresses (IPv4/IPv6) or codecs (Media Gateways) can be called gatewaying as well. It is of course possible that all functionality exists in a single gateway.
Finding gateways between VoIP and a traditional PBX is usually quite simple. Gateways that translate different VoIP protocols are harder to find. Most of them are limited to basic call functionality.
Conference Bridges provide means of having 3- or multipoint conferences that can be either ad-hoc or scheduled. Because of the high resource requirements conference bridges are usually dedicated servers with special media hardware.
A user willing to use a communication service needs an identifier to describe itself and the called party. Ideally, such an identifier should be independent of the user physical location. The network should be then responsible for finding the current location of the callee. One party may define to be reached by multiple contacts.
Regular telephony systems use E.164 numbers - the international public telecommunication numbering plan. An indentifier is composed of up to 15 digits with a leading plus sign, for example +1234565789123. When dialing, the leading plus is normally replaced by the international access code, usually double zero (00). This is followed by a country code and a subscriber number.
First IP telephony systems used IP addresses of end-point devices as user identifiers. Sometimes they are still used now. However, IP addresses are not location independent (even if we use IPv6) and hard to remember (especially if we use IPv6) and therefore they are not suitable for user identifiers.
Current IP telephony systems use two kinds of identifiers:
A Universal Resource Identifier (URI) uses a registered naming space to describe a resource in a location independent way. Resources are available under a variety of naming schemes and access methods including e-mail addresses (mailto), SIP identifiers (sip), H.323 identifiers (h323,RFC3508) or telephone numbers (draft-ietf-iptel-rfc2806bis-02). E-mail like identifiers have several advantages. They are easy to remember, nearly every Internet user already has an e-mail address and a new service can be added using the same identifier. The user location can be find with a Domain Name System (DNS). The disadvantage of URIs is that they are difficult or impossible to dial on some user devices (phones).
If we want to integrate a regular telephony system with IP telephony, we must deal with phone number identifiers even on the IP telephony side. The numbers are not well suitable for the Internet world relying on domain names. Therefore, the ENUM system was invented, using adapted phone numbers as domain names. We will describe ENUM in Chapter 7.