Versions Compared

Key

  • This line was added.
  • This line was removed.
  • Formatting was changed.

...

Table of Contents
excludeTable of Contents

Question of the Day

If you were to design a network to potentially be shared by many users, how would you control access to it?

Assigned Reading

N/A

Physical Local Area Networks

...

The data unit on a LAN is referred to as a frame (not a packet). When a frame is transmitted, it propagates over the entire physical medium. How a frame is forwarded depends on the LAN topology used. There are two popular LAN topologies:

  • Bus
  • Ring

Bus Topology

Image Added

Each end host on a bus has a transceiver that transmits and passively monitors the bus for passing transmissions. Synchronization is performed on frame headers, which contain control information. In a bus topology, every end host connected to the bus receives the header of all transmissions on the bus. If an end host receives a frame header addressed to a different end host, the rest of the frame will be ignored.

Ring Topology

Image Added

In a ring topology, each end host has a transceiver that breaks the circuit of the ring. Unlike a bus topology where each end host passively listens to the bus, each end host actively participates in the ring regardless of a frame’s destination. If an end host receives a frame with header addressed to itself, the end host will copy the frame from the ring; otherwise, the end host will forward the frame towards the next end host on the ring. In a ring topology, frames typically flow in a single predetermined direction.

...

Statistical Modeling of Network Traffic

Memoryless Traffic Source

For many applications that relay data over the network, any additional time elapsed between data transmissions does not appear to depend on the time already elapsed since the previous data was transmitted. Such a source of data is referred to as a memoryless source. Let’s derive a mathematical model for such a memoryless source. 

Consider a data source that produces data arrivals (i.e. data units) at any time starting at time t = 0. Such a source is referred to as a continuous source, since it continuously transmits data at periodic intervals.

Let the number of arrivals through time t be denoted A(t), where A(t) takes on integer values. 

Image Added

The time between the (i – 1)th and ith arrival is referred to as the interarrival time and is denoted τi. If we model each interarrival time τi as a RV, then A(t) is a random process (RP), and the arrival rate of the source is given as:

Image Added

Suppose the interarrival times {τ1, τ2, …} are independent RVs and that for each i, τi has PDF such that:

Image Added

where h(t) is the same function for all i. If this is true, then A(t) is the arrival process for a memoryless source. The remaining lifetime distribution for the RV τi is given by:

Image Added

So, a source is memoryless if the interarrival times of its arrival process have a remaining lifetime distribution:

Image Added

that does not depend on s. The arrival process of a memoryless source is an example of a continuous time Markov chain and a continuous time birth-death process.

Example 1

Image Added

Assume the PDF and CDF of the above arrival process are as follows:

Image Added

Suppose s = 1:

Image Added

For example, s = 1, t = 1:

Image Added

Suppose s = 2:

Image Added

For example, s = 2, t = 1:

Image Added

As indicated by the examples, the interarrival times depend on s. Thus, the source is not memoryless.

Example 2

Image Added

Assume the PDF and CDF of the above arrival process are as follows:

Image Added

Suppose s and t are >= 0:

Image Added

For example, s = 1, t = 1:

Image Added

For example, s = 2, t = 1:

Image Added

As indicated by the examples, the interarrival times do not depend on s. Thus, the source is memoryless.

Deriving the PDF and CDF of a Memoryless Source

What does the memoryless property tell us about the PDF of τi?

Recall that:

Image Added

So:

Image Added

If we set s = 0, we find that:

Image Added

This implies that:

Image Added

which means that we can rewrite:

Image Added

as:

Image Added

So:

Image Added

If the PDF of τi is:

Image Added

then:

Image Added

So:

Image Added

So:

Image Added

But:

Image Added

So:

Image Added

And:

Image Added

with initial condition:

Image Added

Thus:

Image Added

So, the CDF of a memoryless source is defined by:

Image Added

And the PDF of a memoryless source is defined by:

Image Added

Thus, a memoryless source will have exponentially distributed interarrival times.

Example 1

Suppose a memoryless data source has an arrival rate of four data units per second. What is the probability that the time between the first and the second arrivals is more than one second?

Image Added

So:

Image Added

Example 2

A memoryless data source with arrival rate λ starts at time 0. At t = 0, what is the density function of possible times for the first arrival?

Image Added


Image Added

At t = t1, if no arrivals have occurred, what is the density of possible times for the first arrival?

 Image Added

 

Image Added

Example 3

Suppose a memoryless source has an arrival rate of two per second. What is the probability that zero arrivals occur in the interval [t1, t2]?

If we assume that the (i - 1)th arrival is the last one before time t = t1 and that it occurs at t = t1 – t’. Then the answer is given by:

Image Added

What is the probability that exactly one arrival occurs in [t1, t2]? We know that:

Image Added

But this determines the probability that one or more arrivals occur in [t1, t2], which isn’t exactly what we’re after. We can show by solution of appropriate differential equations that:

Image Added

Thus, the arrival rate of the memoryless source is:

Image Added

which is a Poisson distribution.

Thus, it can be concluded that a continuous time, memoryless data source produces independent, exponentially distributed interarrival times. And, independent, identically distributed, exponentially distributed interarrival times result in a Poisson arrival process.

Example 4

Let A(t1) = the number of arrivals in the time interval [0, t1] for a Poisson arrival process with an arrival rate of two per second. A(t1) is a Poisson-distributed RV with PMF:

Image Added

What is the probability that A(t1) is greater than two?

Image Added

Fitting Models to Traffic Patterns

The arrival process of packet/frame can be modeled as a Poisson distribution given exponential interarrival times. If the interarrival times are fixed, such as in the audio/video example, a deterministic model is best suited. Lastly, an Erlang model best fits traffic such as the initiation of telephone calls. 

The lengths of packets/frames are commonly modeled as an exponential distribution given random traffic. However, if the traffic consists of fixed length packets/frames, then a deterministic model is more appropriate. On the other hand, a bimodal model is best suited for a mixture of short and long packets.

Poisson arrivals and exponential packet/frame lengths often provide a realistic model for interactive traffic in a computer network.

Measures of Network Performance

The performance of a computer network depends on:

  • the initial state of the network
  • the sequence of random inputs to the network (i.e. the arrivals)
  • the random internal events within the network (i.e. data processing time, frame errors, retransmissions, etc.)
  • the network architecture and protocols

Most analysis focuses on steady-state conditions, meaning the network has is exhibiting typical behavior. However, note that emphasis can also be placed on the network's response to a transient (i.e. one-time) event.

The type of performance measures to collect (i.e. means, percentiles, etc.) from the network depends on the goal of the analysis.

From a network operator's perspective, commonly sought after performance measures are throughput and utilization.

Throughput

The throughput of the network is the number of bits (or packets/frames) per second that were successfully transmitted at a given location (e.g. link or NIC) in the network. Note that any overhead for errors and retransmissions is not included in throughput. Throughput only includes the amount of data per second successfully transferred across the network as measured at a particular point. One can normalize throughput to a value between 0 and 1 by dividing by the transmission rate of the physical medium of the LAN.

Note that throughput can be computed for more than just a single physical point of a network. A "black box" network, which might consist of some unknown topology, can have its throughput measured by monitoring the total traffic per second that departs the network.

Furthermore, throughput can be computed from different perspectives in the network. If the goal is to compute the throughput of transmissions on the physical medium, then all bits from successfully transmitted frames are included in the calculation. On the other hand, if the goal is to compute the throughput of actual payload data across the network from end host to end host, the throughput computation will exclude the headers of the MAC protocol, as they would be considered overhead.

Utilization

The utilization of the network is the average fraction of time the physical medium of the LAN is busy. This includes all traffic, headers, unsuccessful transmissions, and retransmissions. Utilization can be determined by directly monitoring the physical medium of the LAN. If all transmissions on a LAN are successful, the utilization of the LAN is equal to the normalized throughput.

Related to utilization is channel efficiency, which is the utilization of the physical medium to carry actual data, excluding frame and MAC protocol overhead.