How the internet works

January 2, 2025 · 54 min read

Cloud Network Architect - Microsoft MVP

Introduction

I've been asked to explain networks to people with no experience several times and it's hard to know where to start. There's so much history and so many computer science concepts that have led us to where we are today. I've always believed that to truly understand something, you need to be able to explain it to someone else. My goal here isn't just to explain the bits that make the internet work, but also to organise my own understanding and explore areas where I've taken things on faith instead of questioning why they exist. I'll start from nothing and rebuild the internet from scratch, solving the same problems that got us where we are today.

Let's start with the cable

Our first job is to connect two computers. Computers talk in ones and zeros because they only understand if a signal is present (a one) or absent (a zero). When you connect two computers, the lowest level of connectivity is usually a cable carrying electricity, a fibre optic cable with light pulses, or radio waves with changing frequency or amplitude to show on or off. With electrical signals, you need a ground wire to complete the circuit, but for everything else, you just need one wire to send and another to receive.

Serial vs Parallel Communication

When connecting two computers, we can send data either serially (one bit at a time) or in parallel (multiple bits at once). Each method works better in different situations.

In serial communication, bits travel one after another along a single wire (plus ground). Think of it like a single-lane road where cars (bits) must travel in sequence:

Computer A                Computer B
    [Tx] --------------> [Rx]
    [Rx] <-------------- [Tx]
    [G]  --------------- [G]

    Data: 1 0 1 1 0
    Time: 1 2 3 4 5    (each bit sent sequentially)

Parallel communication, on the other hand, uses multiple wires to send several bits at the same time. This is like a multi-lane highway where multiple cars (bits) can travel side-by-side:

Computer A                Computer B
    [D0] --------------- [D0]
    [D1] --------------- [D1]
    [D2] --------------- [D2]
    [D3] --------------- [D3]
    [D4] --------------- [D4]
    [D5] --------------- [D5]
    [D6] --------------- [D6]
    [D7] --------------- [D7]
    [G]  --------------- [G]

    Data: 1 0 1 1 0 1 0 1
    Time: 1            (all bits sent simultaneously)

While parallel communication might seem better because it can send multiple bits at once, it has its own problems. The main issue is "clock skew" - where bits sent together might arrive slightly out of sync due to tiny differences in wire length or electrical properties. This gets worse as distances grow or speeds increase.

At higher speeds or longer distances, the complexity of keeping multiple data lines synchronised in parallel communication becomes increasingly difficult. This is why modern high-speed communications (like USB 3.0, SATA, and PCIe) use multiple serial lanes rather than parallel communications. They achieve high throughput by running the serial communications at very high frequencies rather than sending bits in parallel.

Serial communication, though it seems slower at first glance, can actually reach higher speeds over longer distances because it only needs to keep timing on a single data line. That's why most modern high-speed computer interfaces have switched from parallel to serial communications, often using multiple serial lanes when they need more bandwidth.

The balance between serial and parallel communications shows how network engineering often means trading theoretical performance for practical solutions. As we continue discussing networking, we'll mainly focus on serial communications since they're the foundation of most modern network interfaces.

Serial clocks

The next problem to solve is making sure both computers know when ones and zeros start and end. Serial connections need both computers to know how fast data is flowing. In a simple system, both computers can be set to the same rate, like 9600 bits per second, and they sample the data at that rate, which usually works fine. For faster and more accurate transfers, we need another wire with a clock signal. This clock signal works like a metronome, giving a timing reference that keeps sender and receiver in sync. This synchronization makes sure bits are sampled at just the right moments to be interpreted correctly.

The baud rate, measured in symbols per second, represents how many signal changes can occur on the communication channel in one second. While baud rate was historically equivalent to bits per second in early systems where each symbol represented one bit, some communication can encode more than one bit in a signal. For example, if we use four different voltage levels to represent two bits per symbol and operates at 1200 baud, it can achieve a data rate of 2400 bits per second. This is only important so that you realise that while bit rate and baud are often used interchangeably they are not the same.

The relationship between clock signals and baud rate is very close - one computer must create a clock signal that matches the baud rate so both computers can sample their incoming data at just the right points between signal changes.

We can send numbers, what about letters?

We've established that computers talk in ones and zeros, but humans prefer text. ASCII (American Standard Code for Information Interchange) solves this by using 7 bits to represent 128 different characters. Each character maps to a specific binary number - for example, the letter 'A' is represented as binary 1000001 (decimal 65).

ASCII's use of 7 bits rather than 8 reflects its 1960s origins, when the 8th bit was reserved for parity - a simple error detection scheme. The parity bit would be set to make the total number of 1s either odd or even, allowing detection of single-bit transmission errors. When 8-bit bytes became standard, various "extended ASCII" encodings emerged using the extra bit to add 128 more characters, though these extensions weren't standardised.

Example byte with even parity:
Letter 'A' = 1000001  (three 1s)
Parity bit = 1        (to make total 1s even)
Final byte  = 11000001

ASCII was designed mainly for English text, with its 128 characters covering the Latin alphabet (both uppercase and lowercase), numbers, punctuation marks, and control characters like carriage return and line feed. This worked fine for English-speaking countries but caused problems for languages with different alphabets or character sets.

Enter Unicode

As computing spread worldwide, ASCII's limits became obvious. Unicode was created to handle text from all the world's writing systems. While ASCII uses just 7 bits, Unicode can use multiple bytes to represent characters, letting it handle millions of different characters instead of just 128.

Character encoding comparison:
ASCII:    'A' = 1000001                    (7 bits)
UTF-8:    'A' = 01000001                   (8 bits)
UTF-8:    '世' = 11100100 10111000 10000000 (24 bits)

For backwards compatibility, Unicode was cleverly designed so that the first 128 characters match ASCII exactly. This means that plain English text looks identical in both ASCII and UTF-8 (the most common Unicode encoding), which helped smooth the transition. This is why protocols like HTTP headers still use ASCII - they can be processed by both old and new systems without any confusion.

We're still moving from ASCII to Unicode. While modern apps and websites mostly use Unicode, many networking protocols and older systems still use ASCII, especially where backward compatibility matters or where only simple English text is needed.

Control characters

The first 32 ASCII values (0-31) are control characters designed for teletype machines and early computer terminals:

00 NUL - Null character, used as a string terminator
01 SOH - Start of Heading, marking message header
02 STX - Start of Text, marking message body start
03 ETX - End of Text, marking message body end
04 EOT - End of Transmission
05 ENQ - Enquiry, request for response
06 ACK - Acknowledge, positive response
07 BEL - Bell, triggers audible alert
08 BS  - Backspace, move cursor back
09 TAB - Horizontal Tab
0A LF  - Line Feed, move down one line
0B VT  - Vertical Tab
0C FF  - Form Feed, advance to next page
0D CR  - Carriage Return, move cursor to line start
0E SO  - Shift Out, switch to alternate character set
0F SI  - Shift In, return to standard character set
10-13 DC1-DC4 - Device Control (DC1/XON and DC3/XOFF for flow control)
14 NAK - Negative Acknowledge
15 SYN - Synchronous Idle
16 ETB - End of Transmission Block
17-1F - Various block/record/unit separators and escape codes

Many are still used today - particularly TAB, LF, CR for text formatting and XON/XOFF for flow control in serial communications.

The full ASCII table is handy when examining low-level protocol captures.

Dec Hex ASCII   Dec Hex ASCII   Dec Hex ASCII   Dec Hex ASCII
--- --- -----   --- --- -----   --- --- -----   --- --- -----
00  NUL     032 20  SPACE   064 40  @       096 60  `
01  SOH     033 21  !       065 41  A       097 61  a
02  STX     034 22  "       066 42  B       098 62  b
03  ETX     035 23  #       067 43  C       099 63  c
04  EOT     036 24  $       068 44  D       100 64  d
05  ENQ     037 25  %       069 45  E       101 65  e
06  ACK     038 26  &       070 46  F       102 66  f
07  BEL     039 27  '       071 47  G       103 67  g
08  BS      040 28  (       072 48  H       104 68  h
09  TAB     041 29  )       073 49  I       105 69  i
0A  LF      042 2A  *       074 4A  J       106 6A  j
0B  VT      043 2B  +       075 4B  K       107 6B  k
0C  FF      044 2C  ,       076 4C  L       108 6C  l
0D  CR      045 2D  -       077 4D  M       109 6D  m
0E  SO      046 2E  .       078 4E  N       110 6E  n
0F  SI      047 2F  /       079 4F  O       111 6F  o
10  DLE     048 30  0       080 50  P       112 70  p
11  DC1     049 31  1       081 51  Q       113 71  q
12  DC2     050 32  2       082 52  R       114 72  r
13  DC3     051 33  3       083 53  S       115 73  s
14  DC4     052 34  4       084 54  T       116 74  t
15  NAK     053 35  5       085 55  U       117 75  u
16  SYN     054 36  6       086 56  V       118 76  v
17  ETB     055 37  7       087 57  W       119 77  w
18  CAN     056 38  8       088 58  X       120 78  x
19  EM      057 39  9       089 59  Y       121 79  y
1A  SUB     058 3A  :       090 5A  Z       122 7A  z
1B  ESC     059 3B  ;       091 5B  [       123 7B  {
1C  FS      060 3C  <       092 5C  \       124 7C  |
1D  GS      061 3D  =       093 5D  ]       125 7D  }
1E  RS      062 3E  >       094 5E  ^       126 7E  ~
1F  US      063 3F  ?       095 5F  _       127 7F  DEL

That's two computers, how about n?

Connecting two computers with a serial connection works well, but what if you want to add a third node? The simplest solution is a full mesh topology where each computer has a direct connection to both other computers.

This approach doesn't scale well. You need one link for two nodes, three links for three nodes, but six links for four nodes. With five computers in a full mesh, you need ten links.

The formula for connections between n computer nodes is n(n-1)/2. The key point is that n is multiplied by n (well, n-1, but that's nitpicking), creating exponential growth. Exponential growth becomes impractical quickly - 50 computers in a small office would need 1,225 connections to be set up, configured and maintained.

Addressing the problem

Since full mesh networks don't scale well, we need to connect multiple devices on a shared wire. But first, we need to solve another problem: making sure each device gets only the data meant for it. This is where addressing begins.

If we have a shared wire connecting all computers (perhaps using T-shaped splitters), any computer can send data that all others receive. By adding a destination address to each data chunk, and having each computer know its own address, devices can ignore data not meant for them. This might sound crazy in today's security-conscious world, but networks were much more trusting in the early days.

Single-wire setups (bus topology) have problems. A cable break anywhere splits your network into two separate networks that can't communicate. A ring topology helps somewhat because a single break just converts it to a bus topology.

Star or hub and spoke topology is better - if one link fails, only that device is affected, not the entire network. The hub is a simple electrical repeater that receives signals from one computer and forwards them to all others. Each computer checks if the destination address matches its own - if yes, it processes the data; if not, it ignores it.

Frames and MAC addresses

To know where data for each destination starts and ends, we organize it in "frames" - discrete blocks with clear start and end points. Data frames have headers containing source and destination addresses, plus an end marker showing where the data payload finishes. These addresses are just numbers (like everything in networks), but they're very large numbers with trillions of possibilities, ensuring they're unique.

A MAC address (short for medium access control address or media access control address) is a unique identifier assigned to a network interface. It is a 48-bit address space which contains potentially over 281 trillion possible MAC addresses. Blocks of addressing were historically allocated to network interface vendors to locally hardcode onto network interfaces; due to the sheer quantity of available addresses it's statistically impossible that two devices on a network could ever have the same MAC addresss.

With our data in frames, all computers connected to a hub, and each one uniquely addressed, we can start scaling our network. We can even connect hubs to other hubs since data sent by one hub will be repeated to all connected hubs. But we must avoid creating loops of hubs, as they aren't smart enough to recognize when a data frame is caught in an endless loop, being repeated by each hub and broadcast to every computer on the network. This is just one way network congestion can happen in hub and spoke networks.

Collisions

As our hub and spoke network grows, we face another problem: collisions. Computers need to take turns sending data. They do this by listening for gaps in data flow at the end of frames before sending their own. Sometimes, two computers will both detect the end of a frame and try to send their own frames simultaneously.

When this happens, the hub (being just a simple electrical repeater) can't buffer one frame while transmitting the other. Instead, when hubs detect multiple senders at once, they reject both transmissions. The sending computers then stop, wait for a random amount of time (to avoid colliding again), and try retransmitting.

As networks grow, collisions become more likely. Performance also drops because lots of data gets sent to computers that simply discard it because it's not addressed to them.

Switching to something cleverer

Instead of simple hubs that just repeat electrical signals, we need a smarter device that can:

Look at the destination MAC address
Compare it to a table in memory
Send data only through the correct port where the destination computer is connected

This raises new challenges. First, we need to build that table mapping MAC addresses to ports. The simplest way is to examine the source MAC address of incoming frames and note which port they arrived on. When we see a destination MAC address we don't recognize, we can "flood" the frame to all ports and see which one gets a response.

This switch needs both a processor and memory, transforming it from a dumb repeater into an actual computing device. With memory, it can temporarily store frames in a buffer while it consults its MAC table to determine where to send them. These buffers also help manage congestion when multiple incoming interfaces try to send to the same outgoing interface.

Congestion

Congestion occurs when multiple computers try to send data to the same computer simultaneously. If two computers are sending at their maximum speed, the receiving computer would need twice the capacity to handle it all.

Sometimes we solve this by increasing the receiving interface's capacity, but more commonly we buffer the incoming traffic and send it out as fast as possible until the buffer clears. When congestion persists for extended periods, we need to prioritize certain traffic types over others.

MAC tables and why they don't scale

Managing a table that maps MAC addresses to ports gets increasingly complex as networks grow. Think about trying to track the MAC address of every device on the internet - billions of them - and knowing which port to use to reach each one. Or imagine flooding the entire internet with a frame just to find one device!

The practical limit to your network size depends on two factors: how big your MAC address table can be, and how efficiently you can search through it. This creates a fundamental scalability problem.

MAC tables work like a phone book lookup. The simplest approach is checking every entry one by one - that's n comparisons where n is the size of your MAC table. You can use smarter algorithms with a sorted table, but then you need to re-sort it whenever the network changes. This explains why network loops cause such havoc for switches - they'll see the same MAC address coming from multiple ports and constantly need to update and re-sort their tables. Even the most efficient sorting algorithms become resource-intensive when run repeatedly.

Finding the router

Since MAC address tables limit our network size, we need to think of networks as separate segments that can be connected into larger networks. To make this work, we need a more organized addressing system that groups addresses into blocks rather than tracking each individual address.

The Internet Protocol (IP) does exactly this. It organizes data into packets (similar to frames) with source and destination addresses.

A helpful analogy: think of an IP packet as a letter addressed to someone in another city, and the frame as the postman's bag carrying it to the post office. You write the address on your letter (packet) and give it to the postman, who puts it in their bag (frame). The bag goes to a sorting office (router) where your letter is taken out, examined, and placed in another bag heading to the right city. When it arrives there, your letter is checked again, matched against delivery routes, and placed in the appropriate postbag for final delivery.

This introduces two key elements: the router device and the Internet Protocol (IP) addressing scheme. IP addresses are simply 32-bit numbers, giving us a range from 0 to 4,294,967,295. Since decimal notation makes these huge numbers hard to work with, we split the 32 bits into four 8-bit chunks (octets) and represent each in decimal form.

If we take a dotted decimal IP address like 192.168.0.1 and convert each octet to binary we get 11000000.10101000.00000000.00000001. This is actually a representation of the single 32 bit number 11000000101010000000000000000001 or 3,232,235,521 in decimal. 

We can group IP addresses into networks by dividing the binary digits into two parts: a network (or subnet) portion and a host portion. The network part identifies which network the host belongs to, while the host part identifies the specific computer within that network.

This structure means routers only need to maintain routes to networks, not to individual hosts. Each route covers all hosts within that network. Routers still need to know the MAC addresses corresponding to IPs in their local subnets, but we'll cover that shortly.

IP addressing originally used a class system, with network size determined by the class. Today, we use CIDR (Classless Inter-Domain Routing) notation instead, which shows exactly how many bits are used for the network portion. This more flexible approach allows networks of any size, not just predetermined class sizes.

Let's look at a simple example: a CIDR network of 10.0.0.0/24. The "/24" means the first 24 bits (conveniently the first three octets) are the network portion, with the last 8 bits (the last octet) being the host portion.

This setup keeps the first 24 bits constant (10.0.0), while we can use the remaining 8 bits to address our devices. With 8 bits, we get numbers from 0 to 255, but we reserve two addresses for special purposes:

0 (10.0.0.0) is the network address itself
255 (10.0.0.255) is the broadcast address, which sends data to all hosts on the segment

This leaves us 254 usable addresses (10.0.0.1 through 10.0.0.254) for our actual devices. With different CIDR sizes, things get more complex when the network/host boundary doesn't align with octet boundaries, but the same principles apply.

We represent IPv4 addresses in dotted decimal, but MAC and IPv6 addresses use hexadecimal notation. Why the difference? It's partly historical and partly practical. Decimal works fine for IPv4 because each octet is just 0-255, which is easy for humans to comprehend. But MAC addresses (48 bits) and IPv6 addresses (128 bits) are much longer, making hexadecimal more efficient. A 16-bit block in IPv6 could be up to 65,535 in decimal, but just four hex digits.

Hex is particularly good for byte-oriented data because one byte (8 bits) maps perfectly to two hex digits. Each 4-bit group (called a nibble) converts to a single hex character 0-F. This makes binary data easier to read and work with, which is why network tools typically show packet captures in hex format.

As IP packets travel across networks, they carry addressing information in a structured header. This header sits in front of the actual data and contains everything routers need to route the packet correctly. Let's peek inside an IP packet to see how these addresses work:

The source and destination addresses each take up 32 bits - exactly one IP address worth of space. But the header contains much more:

A Time to Live field prevents packets from circulating forever in routing loops
A Protocol field identifies what type of data follows (TCP, UDP, etc.)
Several fields handle large messages that need splitting across multiple packets

The Version field tells us which IP version we're using (4 for IPv4). The IHL (Internet Header Length) indicates how long the header is, since it can vary with optional fields. The Type of Service field (now called Differentiated Services) lets us flag packets needing special handling, like voice calls that need minimal delay.

As a packet travels, each router:

Examines the header
Uses the destination address to decide where to send it next
Decreases the Time to Live value by one
Updates the checksum

If Time to Live reaches zero, the router destroys the packet and sends an error message back to the source. This prevents packets from bouncing around endlessly when routing problems occur.

With our IP addressing scheme, we need one more piece of information: where to send packets that aren't on our local network. While routers maintain full routing tables, end hosts just need to know their default gateway - an address on their local network to which they send any traffic destined for other networks.

Going back to our postal analogy: if you give your postman a letter addressed to someone on his route, he'll deliver it directly. For all other destinations, he takes it to the local sorting office - your default gateway to the wider postal system.

In an example where our local host has an address of 10.0.0.1 and is on network 10.0.0.0/24 the network portion is 10.0.0.0 and the host portion is 1. The computer works that out using a subnet mask which comprises of 24 ones (from the 24 in the network address CIDR notation) and 8 zeros to make it up to a total of 32 bits. A 24 bit subnet mask is 11111111.11111111.11111111.00000000 or 255.255.255.0
If we use a logical AND on the subnet mask and the local host address 10.0.0.1 we get get the bits that appear in both

11111111.11111111.11111111.00000000 AND
00001010.00000000.00000000.00000001
    ^ ^
gives
00001010.00000000.00000000.00000000

Now we do a logical AND with the subnet mask and the destination address 10.1.0.1 00001010.00000001.00000000.00000001

11111111.11111111.11111111.00000000 AND
00001010.00000001.00000000.00000001
    ^ ^         ^
gives
00001010.00000001.00000000.00000000

If we now compare those two network addresses, 10.0.0.0 and 10.1.0.0 they are different so know we need to send the packet to the default gateway. 

The Address Configuration Problem

As our networks grow, we encounter a new scaling problem: how do we ensure every device gets the correct network configuration? For a network to function, each device needs:

A unique IP address (to avoid conflicts)
The correct subnet mask (to know what's local vs remote)
A default gateway address (to reach other networks)
DNS server addresses (to resolve names to addresses)

Manual configuration quickly becomes unmanageable:

Network Admin's Challenge:
devices = 400 configuration items to track
devices = 4000 configuration items to track
devices = 40000 configuration items to track

Even in a small office with 50 devices, manual configuration presents several problems:

Address conflicts when two devices are accidentally given the same IP
Configuration errors when subnet masks don't match
Lost connectivity when gateway addresses are typed incorrectly
Wasted addresses when devices leave the network but their IPs aren't reassigned
New devices can't connect until an administrator manually configures them

DHCP solves these problems through automation. It maintains a central pool of addresses and dynamically assigns them when devices join the network:

Address Pool Management:
+-------------------+
| 192.168.1.1-50   | → Accounting Department
+-------------------+
| 192.168.1.51-100 | → Engineering Department
+-------------------+
| 192.168.1.101-150| → Available Pool
+-------------------+
| 192.168.1.151-200| → Reserved for Printers
+-------------------+

The beauty of DHCP is that it solves both immediate and future problems. Not only does it handle initial configuration, but it also manages address reuse through lease times. When a device leaves the network, its address automatically returns to the pool after the lease expires.

The Address Resolution Dilemma

Our network now has two addressing systems that need to work together:

IP addresses (for logical network organisation)
MAC addresses (for physical frame delivery)

This creates a problem: when a device wants to send data to an IP address on its local network, how does it know which MAC address to put in the frame? Consider this scenario:

Device A wants to send to 192.168.1.100
BUT
Frame needs a MAC address destination
AND
Device A only knows the IP address

Without a solution, every device would need a manually maintained table mapping every local IP to its MAC address - essentially the same scaling problem we had with MAC address tables in switches. Moreover, these mappings would need constant updates as devices join, leave, or change addresses.

ARP solves this through a dynamic discovery process:

Problem: Who has 192.168.1.100?
+-----------------+     Broadcast     +------------------+
| Device A        |----------------->| Every Device     |
| "I need to find |                  | "Is this my IP?" |
|  192.168.1.100" |                  |                  |
+-----------------+                  +------------------+
                                           |
                        Unicast Response   |
                  <------------------------|
                  "Yes, that's me, here's 
                   my MAC address"

ARP's elegant solution is to let devices discover mappings as needed and cache them temporarily. This addresses both the scaling problem (you only cache mappings you actually need) and the maintenance problem (cached entries expire automatically).

The real genius of ARP is how it handles network changes:

Scenario: IP Address Changes
Before:
  192.168.1.100 → MAC: 00:11:22:33:44:55

After Device Moves:
  192.168.1.100 → MAC: AA:BB:CC:DD:EE:FF

Solution: Gratuitous ARP
  "Hey everyone, 192.168.1.100 is now at MAC AA:BB:CC:DD:EE:FF"

Both DHCP and ARP are examples of how networking protocols solve complex management problems through automation and discovery rather than manual configuration. They're critical pieces that make modern networks scalable and self-managing.

Finding the route

Once a packet has got to the router the router will then compare it to the route table. The biggest difference between the route table and the MAC table is that the route table has the path to whole networks as a single entry without having to list every single host. The network address that we discussed earlier, with the CIDR prefix length, is used to locate the right path out of all the available paths. So long as each router along the path knows the way to the network that has the host in it they can pass the packet, wrapping it in a fresh frame for each hop, until it reaches the destination network's router and ARP allows it to send it directly to the host computer's MAC address.

Static routing

We can easily build up routing tables that span a large network by taking each hop. Take the example below where an organisation has networked computers in cities across the world connected by underground or undersea cables.

We can simplify the diagram and remove the geography and give each router a name.

The routing table for each site's router would be as follows:

San Francisco (SF):
CH-->NY
NY-->NY    (direct)
ED-->NY    (via NY to LN to ED)
LN-->NY
TK-->NY    (via NY to LN to TK)

Chicago (CH):
SF-->NY
NY-->NY    (direct)
ED-->NY    (via NY to LN to ED)
LN-->NY
TK-->NY    (via NY to LN to TK)

New York (NY):
SF-->SF    (direct)
CH-->CH    (direct)
ED-->LN
LN-->LN    (direct)
TK-->LN

London (LN):
SF-->NY
CH-->NY
NY-->NY    (direct)
ED-->ED    (direct)
TK-->TK    (direct)

Edinburgh (ED):
SF-->LN
CH-->LN
NY-->LN
LN-->LN    (direct)
TK-->LN

Tokyo (TK):
SF-->LN
CH-->LN
NY-->LN
ED-->LN
LN-->LN    (direct)

If you trace the path of a packet from Chicago to Tokyo you can see at each step the router knows where to send the packet but it doesn't know the whole route. You can simplify things even further with stub networks like Edinburgh, Chicago and Tokyo because all traffic from them goes to the same next hop

San Francisco (SF):
CH-->NY
NY-->NY    (direct)
ED-->NY    (via NY to LN to ED)
LN-->NY
TK-->NY    (via NY to LN to TK)

Chicago (CH):
All traffic (default route) -->NY

New York (NY):
SF-->SF    (direct)
CH-->CH    (direct)
ED-->LN
LN-->LN    (direct)
TK-->LN

London (LN):
SF-->NY
CH-->NY
NY-->NY    (direct)
ED-->ED    (direct)
TK-->TK    (direct)

Edinburgh (ED):
All traffic (default route) -->LN


Tokyo (TK):
All traffic (default route) -->LN

This is great until we want to make changes, or perhaps changes are forced upon us by unexpected failures. Let's say we add in an undersea cable crossing the pacific to connect Tokyo and San Francisco. Many such cables exist and we might want to use it for communications between those two sites. We probably don't want to use it for communications between London and New York though because that cable is shorter.

We can update the routing table for the two sites:

San Francisco (SF):
CH-->NY
NY-->NY    (direct)
ED-->NY    (via NY to LN to ED)
LN-->NY
TK-->SF    (direct via pacfic cable)

Tokyo (TK):
SF-->SF    (direct via pacfic cable)
CH-->LN
NY-->LN
ED-->LN
LN-->LN    (direct)

Everything else can remain the same because all other sites will use the existing routes until disaster strikes and the cable between London and New York gets damaged. We can connect to each router and update the routing table to reflect the change, sending all traffic via SF and TK rather than via NY and LN. Traffic between NY and LN goes the long way round but everything works again. In practice though this just isn't going to work. Statically assigned routing is a huge overhead because as sites increase the number of routing table entries increases exponentially.

Think back to the full mesh network topology at the start - every node needed a path to every other node so the number of paths was n(n-1)/2 - the same calculation works here if you don't do anything clever with route summarisation.

Dynamic routing

Building up a routing table dynamically relies on routers talking to each other. This is done using routing protocols which are essentially ways for routers to exchange information about the networks they know about and in many cases the distance or cost of getting there over that path. One of the most popular protocols, OSPF (Open Shortest Path First), uses a route finding algorith called Dijkstra's Algorithm which is the same algorithm used by some sat nav units to calculate the best route to somewhere taking into account distance, road congestion, and speed limits.

A better solution for Sat Navs is likely to be the A* search algorithm which takes into account heuristics to indicate roughly what direction the target is from the start point. If you are in London trying to find a route to Edinburgh you are probably going to be wasting your time looking at routes that start off going south. OSPF has no such heuristic to influence path selection so relies on cost alone.

Now we have got to the point where we can get data from one part of the world to the other across our IP network. We still need to figure out a bit more about what to send to make it useful but we don't operate the only network. The Internet is, by definition, a collection of networks that have joined together to form a larger network. Inter is a prefix that means between, whereas intra means within. An intranet is within your own network and internet means between separate networks. So how do we connect our networks and exchange routing information with other networks in a safe way? We want to have something on the border of our network that allows us to carefully exchange information about the networks we know about within our autonomous system and the networks they can see within their autonomous system. We might also want to exchange information about other autonomous systems that we are connected to that others can reach through us. The protocol for exchanging this information at the border of our network is Border Gateway Protocol, or BGP.

Peering with other networks

When networks grow beyond a single organisation, or autonomous system, we find we need to share routing information whilst maintaining control and independence. Each autonomous system needs to:

Control what routes they advertise to others
Choose which routes to accept from others
Make policy decisions about traffic flow
Maintain stability when other networks have problems

Each autonomous sytem for internet peering is assigned a unique number, the Autonomous System Number (ASN). BGP uses this to track the path a route has taken through different networks to ensure that there are no loops. It also uses the list of ASNs that a route has been through to get a crude estimate of the distance to the destination using the path length; in most cases a route that passes through the fewer networks is considered better than one that passes through more networks. Unlike internal routing protocols, BGP is not designed to focus on finding the shortest path, it is designed to solve a different set of problems:

Policy Enforcement

BGP allows network owners to apply policies to routing decisions at their perimeter to control what routes they allow into their network. Note that BGP controls the routing information, the control plane, and not the data plane itself. Firewalls and other security technologies provide that layer of protection. As part of the agreement to peer with another network it is common for organisations to share their routing policy so that both peers know what to expect.

AS 65001's Policy:
- Advertise only our customer networks
- Accept only customer routes from AS 65002
- Prefer paid links over free peering

AS 65002's Policy:
- Advertise all routes to paying customers
- Accept all routes from AS 65001
- Filter out private networks

Route Aggregation

Where networks are made up of contiguous smaller subnets it may be possible to summarise those networks into a larger network and just advertise the route to that. This reduces the overall complexity of the routing table.

Instead of advertising:
168.1.0/24
168.2.0/24
168.3.0/24
168.4.0/24

BGP can summarise as:
168.0.0/22

Path Selection

BGP's genius is that it doesn't just share routes - it shares the path to reach those routes. This path information (called the AS_PATH) lets networks make informed decisions about which routes to trust and use. It's like not just knowing there's a road to a destination, but knowing exactly which countries you'll pass through to get there.

BGP routers establish TCP sessions (on port 179) with their peers and exchange routes through UPDATE messages:

Router A                Router B
   |                      |
   |---Open------------->|    "Hello, I'm AS 65001"
   |<--Open--------------| "Hello, I'm AS 65002"
   |                      |
   |---Update----------->|    "I can reach 192.168.0.0/22"
   |<--Update------------| "I can reach 10.0.0.0/8"
   |                      |
   |---Keepalive-------->|    "Still here!"
   |<--Keepalive---------| "Still here too!"

BGP does not direcly allow a network admin to make routing choices within another ASN but it does allow them to provide information which can influence that decision. Routing decisions within an ASN are entirely controlled by the network admins of that ASN and how they choose to use the routing information provided to them.

BGP Path Selection (in order):
Highest LOCAL_PREF
Shortest AS_PATH
Lowest ORIGIN type
Lowest MED
External over internal BGP
Lowest IGP metric to next hop
Lowest router ID (tie breaker)

BGP's deliberate slowness in converging is actually a feature, not a bug. When a route disappears and reappears frequently (called route flapping), BGP implements route dampening to maintain stability. This prevents unstable routes from causing cascading problems across the internet.

What to transport

We can now get a packet from our source, anywhere in the world, to our destination, also anywhere in the world. We are sending that data in discrete packets and to avoid congestion on the network we probably want to keep those packets small so we don't clog up a link and stop other packets flowing over it for too long. That's good for sending a single packet but what about if we want a longer connection with data flowing back and forth? What if we want to actually get a response or an acknowledgement that the packet was received? Here we need to look at the protocols within IP and what they can do for us.

The protocols you are most likely to already have heard of are TCP, UDP and perhaps ICMP. ICMP is what is used in the ping command which is a rather troublesome way to troubleshoot network connectivity.

UDP (User Datagram Protocol)

Once we can route packets across networks, we need a way to get information from one application to another. The User Datagram Protocol (UDP) provides one of the simplest solutions to this problem. Think of UDP like sending a postcard - you write your message, address it, and send it on its way. Once it's posted, you have no way of knowing if it arrived, and you won't get any confirmation of delivery. This might seem like a limitation, but in many cases, it's exactly what we need.

The word "datagram" in UDP's name gives us a clue about how it works - it's about sending independent messages (datagrams) that each carry their own addressing information. Each message stands alone, like a self-contained postcard, rather than being part of a larger conversation.

Looking inside a UDP message reveals a remarkably simple structure. The header contains just four essential pieces of information:

The source and destination ports tell us which applications should handle the message at each end. The length field tells us how big the entire package is, and a checksum provides basic error detection. That's all there is to it - UDP adds just enough information to get a packet of data from one application to another.

This simplicity makes UDP perfect for applications where timeliness matters more than perfect reliability. Consider a voice chat application: if a packet containing a fraction of a second of audio gets lost, it's better to skip that tiny bit of sound than to wait for it to be sent again. By the time the lost audio would arrive, the conversation would have moved on, and the delayed sound would be more disruptive than the brief silence from the lost packet.

Online gaming often uses UDP for similar reasons. When a game is sending constant updates about player positions, losing an occasional update isn't catastrophic - the game can make educated guesses about where things should be until the next update arrives. The important thing is keeping the game moving without introducing delays.

DNS, the internet's phone book that we discussed earlier, also uses UDP for its queries. When you want to look up a website's IP address, adding complex reliability mechanisms would just slow things down. A simple query and response is much more efficient, and if something goes wrong, the application can simply try again.

The lack of connection state in UDP also makes it ideal for services that need to handle huge numbers of clients. A DNS server doesn't need to maintain information about every client that might ask it a question - it just answers queries as they arrive. This stateless nature allows UDP servers to be much more scalable than more complex approaches.

UDP's simplicity extends to error handling as well. While it does include a checksum to detect corrupted packets, it doesn't do anything about them - corrupted packets are simply discarded. There's no mechanism to send packets again if they're lost, no system to confirm receipt, no guarantee that packets will arrive in the same order they were sent. If an application needs any of these features, it must implement them itself.

The "fire and forget" nature of UDP can sometimes lead to interesting challenges. Without built-in flow control, it's possible for a fast sender to overwhelm a slow receiver, causing packets to be dropped. Applications might need to implement their own mechanisms to avoid overwhelming network links. These considerations make UDP programming more challenging in some ways, despite its simpler protocol.

This might seem like UDP is pushing complexity up to the application layer, and in a sense, it is. But this is actually one of UDP's strengths - it allows applications to implement exactly the level of reliability they need, no more and no less. A video streaming application might choose to resend certain critical packets but not others, based on their importance to the video quality.

Not every application can work with UDP's simple "best effort" delivery model. Many applications nee

TCP (Transmission Control Protocol)

TCP is the workhorse of the internet, providing reliable, ordered delivery of data between applications. While UDP simply sends packets and hopes for the best, TCP creates a sophisticated conversation between sender and receiver. Think of it like the difference between dropping a letter in a postbox and having an important conversation over the phone - with TCP, both sides actively participate in ensuring the message gets through.

The magic begins with what we call the three-way handshake. Imagine you're making a phone call: first you dial (sending a SYN flag), the other person picks up and says "hello" (sending back a SYN-ACK), and you respond with your own "hello" (sending an ACK). This seemingly simple exchange actually establishes something quite sophisticated - both computers agree on sequence numbers they'll use to keep track of the conversation, much like you might number pages in a long letter to ensure they stay in order.

Let's peek inside a TCP header to understand how this works. Every TCP segment (that's what we call the individual pieces of a TCP stream) carries a wealth of information in its header:

The sequence and acknowledgment numbers are particularly clever. The sequence number identifies each byte in the stream of data being sent, while the acknowledgment number tells the other side which byte is expected next. This system allows TCP to handle lost, duplicated, or out-of-order packets gracefully.

When data starts flowing, TCP doesn't just send everything at once. Instead, it uses a sophisticated flow control mechanism called the sliding window. The "Window" field in the header tells the sender how much data the receiver is willing to accept. This window slides forward as data is acknowledged, preventing any one side from overwhelming the other with too much data too quickly.

The flag bits in the TCP header each serve specific purposes. SYN (synchronise) and FIN (finish) manage connection establishment and teardown. ACK (acknowledge) confirms received data. PSH (push) suggests the receiver should pass this data to the application immediately rather than buffering it. URG (urgent) marks priority data, though it's rarely used today. RST (reset) abruptly terminates connections when something goes wrong.

TCP also includes a built-in capability to handle network congestion. If packets start getting lost (which we can detect through missing acknowledgments), TCP assumes the network is congested and slows down its transmission rate. It then gradually speeds up again as packets successfully get through. This self-regulating behaviour helps prevent network collapse under heavy load.

The connection teardown is as carefully managed as the setup. When one side is finished sending data, it sends a FIN flag. The other side acknowledges this and may continue sending its own data. When it too is finished, it sends its own FIN, which gets acknowledged. This four-way handshake ensures both sides have finished their conversation before the connection is fully closed - like saying "goodbye" and waiting for the other person to say "goodbye" too before hanging up the phone.

All these mechanisms work together to provide what we call TCP's reliability guarantees: data arrives in order, without gaps, exactly once. This reliability comes at the cost of some overhead - those acknowledgments and window advertisements take up bandwidth, and waiting for acknowledgments adds latency. But for many applications, from web browsing to email to file transfer, this is a worthwhile trade-off for the assurance that all data will arrive correctly.

The sophistication of TCP becomes particularly apparent when things go wrong. If a packet is lost, TCP automatically retransmits it. If packets arrive out of order, TCP holds them until the gaps can be filled in. If the network becomes congested, TCP adjusts its sending rate to help alleviate the problem. All of this happens without the applications having to worry about it - they just see a reliable stream of data.

What makes TCP truly remarkable is how it provides this reliability over an unreliable network. The Internet Protocol (IP) makes no guarantees about packet delivery - packets can be lost, duplicated, or arrive out of order. TCP builds reliable communication on top of this uncertainty, much like building a dependable postal service using couriers who might occasionally lose packages or deliver them in the wrong order. The fact that this works so well is a testament to the elegant design of the protocol.

The phone book

By now we have used analogies including the postal service, text messaging and phone calls so it was only a matter of time before we brought the phone book into it. It may not be as common to people now but we used to have big books in our houses with all the phone numbers of all the houses in our local area. For anyone under 30 this might seem bonkers.

We've looked at IP addresses. They're just a 32 bit number expressed in a dotted decimal notation for humans to read. So when you want to contact me at 74.125.133.27 you can do so now right? People don't want to have to remember 32 bit numbers so we need a more friendly name for computer systems around the world and we need to make it able to scale; the best way to make things scale is often to distribute them rather than store them centrally.

DNS, or the domain name system, introduces a heirarchical naming system which can be distributed such that individual records can be delegated down the organisations and individuals to manage using a system of distributed authority. If you take my website address, www.simonpainter.com, it has three clear parts: the host name www, which is the convention for hosts that make up the world wide web, the domain simonpainter.com which has a top level of com and a second level of simonpainter.

I want to stop and talk about the dot in the domain name, or more interestingly the fact that it doesn't exist in the actual DNS query. The space between each portion of the domain name is filled with a byte containing the length of the next portion.

Common packet capture output:
 00 11 22 33 44 55 66 77   01 02 03 04 05 06 07 08
 03 77 77 77 06 67 6F 6F   67 6C 65 03 63 6F 6D 00

Converting the DNS query above:
        Length (3)
77 77    'www'       
        Length (6)
6F 6F    'goo'
6C 65    'gle'
        Length (3)
6F 6D    'com'
        End marker

You would express www.google.com as 03 77 77 77 06 67 6f 6f 67 6c 65 03 63 6f 6d where 03, 06 and 03 are the lengths of the www, google, and com respectively.

When I want to look up how to get to www.simonpainter.com I start by querying my local DNS server. It's another one of those options that can be configured manually or with DHCP so I know my DNS server's IP address (or more likely I know more than one). I send a UDP packet asking for it to resolve www.simonpainter.com to an IP address. My local DNS server probably doesn't know the answer because it's not the authority for that domain so it does one of two different things: it could forward all queries to a different DNS server, perhaps a central one managed by my internet service provider, or it could try recursively looking it up itself. If it forwards it to the ISP DNS server it's likely the ISP DNS server would do a recursive lookup. In the event that someone else had already recently requested that lookup from the ISP it may be able to respond with a cached answer rather than have to look it up again. In order to do a recursive lookup the first place to start is the DNS root servers. They are globally distributed computers which all hold the same root zone that is the starting point for a lookup. DNS servers just have to know where to find the root servers so all recursive DNS servers will hold a list of the root servers and their IP addresses. The root servers will know the name servers that hold information for anything ending .com and will respond with that information. The record from the root zone will look like this for .com.

com.            172800    IN    NS    a.gtld-servers.net.
com.            172800    IN    NS    b.gtld-servers.net.
com.            172800    IN    NS    c.gtld-servers.net.
com.            172800    IN    NS    d.gtld-servers.net.
com.            172800    IN    NS    e.gtld-servers.net.
com.            172800    IN    NS    f.gtld-servers.net.
com.            172800    IN    NS    g.gtld-servers.net.
com.            172800    IN    NS    h.gtld-servers.net.
com.            172800    IN    NS    i.gtld-servers.net.
com.            172800    IN    NS    j.gtld-servers.net.
com.            172800    IN    NS    k.gtld-servers.net.
com.            172800    IN    NS    l.gtld-servers.net.
com.            172800    IN    NS    m.gtld-servers.net.

From one of those name servers for .com the DNS server will find out the name servers for simonpainter.com and that will be where they can get an authorative answer for www.simonpainter.com.

This only covers DNS A records (host records) and NS records (name server records) but there are several other record types which serve different purposes. A records resolve IPv4 and AAAA records are used for IPv6, this is because IPv6 addresses wouldn't fit in the space allocated for IPv4 addresses in A records. The name AAAA comes from it being 4 times the size of an A record.

A moment to talk about security

So far we've just been sending plain ASCII, or Unicode, text. When we establish communications between computers we now want to ensure there is some degree of encryption.

Transport Layer Security (TLS) evolved from Secure Sockets Layer (SSL), which Netscape developed in 1994. SSL went through versions 1.0 (never released), 2.0, and 3.0, but security flaws led to the development of TLS 1.0 in 1999. TLS provides encryption, authentication, and integrity for data transmitted between clients and servers, using a combination of asymmetric encryption for key exchange and symmetric encryption for data transfer.

The adult entertainment industry, facing challenges with online payment processing in the mid-1990s, played a crucial role in advancing secure online transactions. Traditional payment processors were hesitant to handle adult content transactions, leading adult websites to develop and fund their own payment solutions. This investment helped establish the infrastructure for secure online payments, pushing forward the adoption of SSL/TLS encryption for e-commerce. The same technologies later became standard across all industries for protecting sensitive data during online transactions.

Public Key Cryptography

Public key cryptography solves a fundamental problem: how do you share a secret key with someone when anyone could be listening? The solution uses pairs of mathematically linked keys - one public that can be freely shared, and one private that must be kept secret. Data encrypted with the public key can only be decrypted with the private key, and vice versa.

Example:
1. Alice generates key pair: public key Pa, private key Sa
2. Alice shares Pa publicly, keeps Sa secret
3. Bob wants to send message M to Alice:
   - Encrypts M with Pa: C = encrypt(M, Pa)
   - Sends C to Alice
4. Only Alice can decrypt C using Sa:
   - M = decrypt(C, Sa)

The mathematics behind public key cryptography relies on problems that are easy to do in one direction but computationally infeasible to reverse, like multiplying two large prime numbers versus factoring their product. This asymmetry provides the security foundation for modern encryption.

How TLS Works

TLS operates through a handshake process that establishes a secure connection. First, the client and server agree on which version of TLS and which encryption algorithms they'll use. The server presents its digital certificate, which contains its public key. Using this public key, the client and server securely exchange a session key, which they then use for symmetric encryption of the actual data transfer.

TLS Handshake Process:
Client Hello: Supported TLS versions and cipher suites
Server Hello: Chosen version and cipher suite
Server Certificate: Public key and identity verification
Key Exchange: Generate shared session key
Handshake Complete: Begin encrypted communication

The beauty of this system is that it combines the security of asymmetric encryption for the initial handshake with the speed of symmetric encryption for the ongoing data transfer. Each new session gets a unique session key, ensuring that even if one session is compromised, others remain secure.

And finally the application

We know how to resolve a name to an IP address anywhere in the world, we know how to route traffic to it across many different networks and we understand right the way down to the pulses of light or electricity through a cable how we get data across networks. Once it arrives at a computer we know that if the destination MAC address is right the frame will be received and the packet inside inspected. We know that it might be a UDP packet or it might be part of a TCP session but whatever it is there will be an application waiting there listening for that data. But what if there are more than one networked applications running on that computer? How do we ensure that data for an application is passed from the network interface to the correct application?

A socket is the construct that serves as the endpoint for a networked application to send and receive data. In the context of IP a socket address is the triad of transport protocol, IP address, and port number. This recognises that systems may have more than one IP address and more than one interface. An example of a socket for a web server might be tcp/80 on address 192.168.0.50. Wildcards are sometimes used in configuration where an application is bound to any IP address associated with a system. Note that TCP and UDP would listen on different sockets however some applications, such as DNS, use both and would therefore have sockets for both.

Applications have commonly associated ports and the TCP or UDP header will have the application port within it. When the packet is received the port number, along with the protocol and the IP address, are used to pass the payload of the packet to the correct application. Those applications then parse the data according to their own application protocols. There are a number of common TCP and UDP port numbers that are by convention associated with common applications however you can configure most applications to listen on any port and so long as the sender knows to use the non standard port then everything will still work.

Some of the well-known ports (1-1023):
Port    Proto   Service         Purpose
    TCP     FTP-data       File transfer data channel
    TCP     FTP-control    File transfer control channel
    TCP     SSH            Secure remote access and file transfer
    TCP     Telnet         Legacy remote terminal (unencrypted)
    TCP     SMTP           Email submission
    TCP/UDP DNS            Name resolution
    UDP     DHCP-Server    Network configuration (server side)
    UDP     DHCP-Client    Network configuration (client side)
    UDP     TFTP           Trivial file transfer (network boot)
    TCP     HTTP           Web traffic
   UDP     NTP            Time synchronisation
   TCP     HTTPS          Secure web traffic
   TCP     SMB            File/printer sharing
   UDP     Syslog         System logging

Sir Tim and the World Wide Web

In 1989, Tim Berners-Lee proposed a solution for a distributed knowledge sharing system that was worked on at CERN and named the World Wide Web. It comprised of an application protocol called hypertext transfer protocol (http) and a markup language called hypertext markup language (html).

The Birth at CERN (1989-1991)

Tim Berners-Lee, a British physicist and computer scientist working at CERN (the European Organization for Nuclear Research), identified a critical problem: scientists struggled to share information efficiently across different computer systems. His solution was revolutionary yet elegantly simple.

In March 1989, Berners-Lee submitted a proposal titled "Information Management: A Proposal" to his supervisors at CERN. By December 1990, he had implemented the first web server (CERN HTTPd), browser (WorldWideWeb, later renamed Nexus), and the initial versions of HTML and HTTP.

HTML: The Evolution of a Markup Language

HTML began as a simple language derived from SGML (Standard Generalized Markup Language). The first version was informal, containing just 18 elements focused on basic document structure and hyperlinks.

HTML 1.0 (1991): Never formalized but established the concept of linked documents
HTML 2.0 (1995): The first official specification, standardizing core elements
HTML 3.2 (1997): Introduced tables and applets
HTML 4.0 (1997-1999): Added stylesheets, scripts, and accessibility features
XHTML (2000): Reformulated HTML as an XML application for stricter validation
HTML5 (2014): A major overhaul introducing semantic elements, multimedia support, and APIs for complex web applications

HTTP: From Simple to Sophisticated

HTTP evolved alongside HTML to facilitate the transfer of web resources:

HTTP/0.9 (1991): An unnamed, extremely simple protocol allowing only GET requests
HTTP/1.0 (1996): Added headers, status codes, and support for different document types
HTTP/1.1 (1997): Introduced persistent connections, chunked transfers, and host headers
HTTP/2 (2015): Introduced multiplexing, server push, and header compression for improved performance
HTTP/3 (2022): Implemented over QUIC protocol to reduce latency and improve reliability

From Academic Project to Global Revolution

What began as a solution for physics researchers quickly transcended CERN's walls. By 1993, CERN announced that the World Wide Web would be free for everyone to use and develop. This decision catalysed exponential growth.

The creation of the W3C (World Wide Web Consortium) in 1994, with Berners-Lee as its director, established a governing body to guide web standards development. This ensured that the web would remain open, accessible, and universally compatible.

By the mid-1990s, the Web had transformed from an academic curiosity to a global phenomenon, forever changing how humanity communicates, conducts business, and shares knowledge.

People don't need to study preposterous acronyms

Now you have reached the end of this you also know a bit more about the OSI model and what people mean when they talk about 'layer 2' or 'layer 3' networks. Networking is conceptually organised into discrete layers which have standardised interfaces between them. Those layers have irritatingly unintuitive names:

Layer 1 - Physical
Layer 2 - Datalink
Layer 3 - Network
Layer 4 - Transport
Layer 5 - Session
Layer 6 - Presentation
Layer 7 - Application

There are lots of ways to remember the order, PDNTSPA: 'Please Do Not Throw Sausage Pizza Away' is one particularly bad example. What is important is to understand that the bits where we discussed the cable and underlying physical network connections were all about layer 1, the hub and switch topology and data frames between MAC addresses were all at layer 2. Routing and routing protocols were layer 3 while layer 4 is made up of the transport protocols like TCP and UDP. Internet Protocol blurs the model a bit at the top because layers 5, 6 and 7 are all associated with application traffic but we now know quite a lot about that too. You should now be able to visualise the path of date from when you put this web page address into your browser and the page loaded up for you to read. I hope you've enjoyed the process.

Introduction​

Let's start with the cable​

Serial vs Parallel Communication​

Serial clocks​

We can send numbers, what about letters?​

Enter Unicode​

Control characters​

That's two computers, how about n?​

Addressing the problem​

Frames and MAC addresses​

Collisions​

Switching to something cleverer​

Congestion​

MAC tables and why they don't scale​

Finding the router​

The Address Configuration Problem​

The Address Resolution Dilemma​

Finding the route​

Static routing​

Dynamic routing​

Peering with other networks​

Policy Enforcement​

Route Aggregation​

Path Selection​

What to transport​

UDP (User Datagram Protocol)​

TCP (Transmission Control Protocol)​

The phone book​

A moment to talk about security​

Public Key Cryptography​

How TLS Works​

And finally the application​

Sir Tim and the World Wide Web​

The Birth at CERN (1989-1991)​

HTML: The Evolution of a Markup Language​

HTTP: From Simple to Sophisticated​

From Academic Project to Global Revolution​

People don't need to study preposterous acronyms​