Table of Contents

exclude	Table of Contents

Question of the Day

TBD

Queuing Models

The goal of this of this module is to introduce the simplest way in which we can model the behavior of a link.

Queueing Systems

A queueing system consists of one or more servers that provide some desired service to arriving customers. The system consists of the servers with a line/queue in which these customers must wait to be served. The queue must be able to hold zero or more customers. Note that oftentimes the term queue is used to refer to both the servers and the line together. After all, it doesn't make sense to wait in a queue for nothing, right?

Such a system consists of the following dynamics, in order:

A customer arrives at the queueing system.
The customer waits in the queue for service by a server.
The customer receives service from a server.
The customer departs (ideally happily) from the queueing system.

Bringing this discussion back to what we know about LANs, a customer is a packet, message, or frame, the queue is some buffer or block of memory in which the packet can be placed while it waits for processing, a server is the link, switch, or other resource on which the packet is waiting, and a departure is a packet that is transmitted by the server.

A queueing system can often be depicted by the following diagram:

Image Added

As an example, let's consider a point-to-point topology with a single link that allows data to be requested by one end host from a remote end host. Let's assume this link has no probability of error during its operation.

Image Added

In this case, the customers are the requests from the client's application, the queue is the source buffer feeding into the NIC of the client, the service desired is the transmission of the request over the point-to-point link, and a departure is a completed transmission over the link to the remote end host.

Classification of Queues

Kendall's Notation

In order to keep track of the different types of queues, we'll adopt a notation called Kendall's notation. The general form appears as follows:

x / y / n / m / p

where:

x is the distribution of interarrival times
- M is Markovian or memoryless, which implies a Poisson arrival process (i.e. exponentially distributed interarrival times)
- D is Deterministic or constant
- Er is r-stage Erlang
- G is General (i.e. any valid distribution function)
y is the distribution of service times
- M is Markovian or memoryless, which implies exponentially distributed service times
- D is Deterministic or constant
- Er is r-stage Erlang
- G is General (i.e. any valid distribution function)
n is the number of parallel servers
- supports 1 to infinity
- greater than 1 server exploits processing parallelism
m is the number of customers allowed in the queueing system
- supports 1 to infinity
- 1 implies zero queue length and one position in server itself
- infinity implies an infinite buffer space (assume infinity if m is omitted)
- any surplus in customers must be returned to sender or discarded
p is the population size or number of customers that generate arrivals
- supports 1 to infinity
- if p is finite and p customers are in the system, then no new arrivals may occur (i.e. source exhaustion)
- if p is omitted, assume p is infinite

Queuing Discipline

The queueing discipline is the order in which customers are served from the queue or waiting line. There are a number of types of queuing disciplines, such as:

First-come, first-served (FCFS)
- process customers in order of arrival
- most common queueing discipline
- e.g. linked list data structure where 0th element is always removed
Last-come, first-served (LCFS)
- process most recent customer to have arrived
- could lead to starvation
- e.g. use of a stack via push and pop (brings back memories from x86 assembly, right? )
Round robin (RR)
- Poll from separate queues in a circular manner
Processor sharing (PS)
- Time shared access to server
- All customers served simultaneously (but a little at a time)
- e.g. a CPU on a general purpose machine where the OS schedules tasks
Random
Priority
- Multiple classes of customers
- Some classes have higher or lower priority than others
- Higher priority customers receive "better" service
- Lower priority customers receive "worse" service

Measuring the Performance of Queueing Systems

The performance of a queueing system can be assessed in a variety of ways, such as:

Mean delay
- time a customer spent in the entire system
- time a customer spent in the queue only
Mean queue length
- number of customers in the entire system
- number of customers in the queue only
Server utilization
- fraction of time the server is busy
Throughput
- the amount of data moved through the system minus any overhead per unit time

Using these metrics, we can determine useful information about the performance of links or even entire networks. In fact, a relatively small amount of information about network traffic statistics is required to ascertain network performance.

Basic Properties of Networks and Simplifying Assumptions

We can view a network as a "black box" system, which is to say we need not know any of the internal implementation details of the network. We can learn properties of this black box network though by observing characteristics, such as the entry and exit points of the network.

Image Added

To simply our models, we will assume a few things:

Any customer that arrives from outside the network may be stored inside the network but must eventually depart the network.
Customers are neither created nor destroyed in the network.
We will consider stable networks that are operating in steady state so that the statistical behavior is the same at any time and that any measuring such as averaging over long time intervals yields statistical averages.
The arrival rate of customers/packets into the network is equal to the rate of customers/packets leaving the network.

Note that for item (4), if the arrival rate is greater than the departure rate, the number of customers in the network approaches infinity, which results in an unstable system. Conversely, if the arrival rate is on average less than the departure rate, then the number of customers stored in the network approaches zero, which results in a stable system where the arrival rate is equal to the departure rate.

Little's Law

Intuitive Derivation of Little's Law

Little's Law can be used to derive properties of networks given statistical information about network traffic.

If no customers are in the network at time t = 0, then let A(t) be the number of arrivals in the interval [0, t] and let D(t) be the number of departures in the interval [0, t].

Thus, it can be said that the number of customers in the network at a given time t, N(t), is N(t) = A(t) - D(t).

Image Added

The above figure depicts the number of customers in the network, N(t), given the number of customer arrivals, A(t), and the number of customer departures, D(t), at a given time.

The measured arrival rate over the interval [0, t] is:

Image Added

and the total time spent in the system by all customers during this interval [0, t] is:

Image Added

which is the area between A(x) and D(x) over all x in the interval [0, t].

Thus, the average number of customers in the system during the interval [0, t] is:

Image Added

Consider all customers in system at time t (i.e. N(t) customers). Let R_N(t) denote the total time after t that those N(t) customers spend in the system. The average time spent in the system by all customers arriving in the interval [0, t] is:

Image Added

So:

Image Added

and:

Image Added

For the stable system we are concerned with, the following limits hold:

Image Added

and

Image Added

So:

Image Added

For our systems of interest, N is the mean number of customers in the system, λ is the arrival rate (i.e. the mean number of arrivals per second), and T is the mean time that a customer spends in the system. Thus, Little's Law can be defined as:

Image Added

Little's Law applies to any closed boundary containing system elements that do not produce or consume customers. As such, it is valid for many deterministic or probabilistic queueing systems and does not depend on any specific distribution function or service times.

Applications of Little's Law

There are many ways we can apply Little's Law in analyzing networks.

Version	Old Version 4	New Version 5
Changes made by	Ryan Izard	Ryan Izard
Saved on	Jan 25, 2016	Jan 25, 2016

Versions Compared

Key

Question of the Day

Queuing Models

Queueing Systems

Classification of Queues

Kendall's Notation

Queuing Discipline

Measuring the Performance of Queueing Systems

Basic Properties of Networks and Simplifying Assumptions

Little's Law

Intuitive Derivation of Little's Law

Applications of Little's Law

Page Comparison

Versions Compared

Key

Question of the Day

Queuing Models

Queueing Systems

Classification of Queues

Kendall's Notation

Queuing Discipline

Measuring the Performance of Queueing Systems

Basic Properties of Networks and Simplifying Assumptions

Little's Law

Intuitive Derivation of Little's Law

Applications of Little's Law