January 2, 2025 8 min to read

How Web Browsers Work

Understanding the internal working mechanism of web browsers, from DNS lookup to page rendering.

Overview

Let’s explore how web browsers work internally, from the moment a user enters a URL until the page is displayed.

Browser Components

Modern web browsers consist of several main components:

User Interface - Address bar, back/forward buttons, bookmarks, etc.
Browser Engine - Marshals actions between the UI and the rendering engine
Rendering Engine - Responsible for displaying requested content (parses HTML and CSS)
Networking - Handles HTTP requests
JavaScript Engine - Parses and executes JavaScript code (e.g., V8 in Chrome)
UI Backend - Used for drawing basic widgets like combo boxes and windows
Data Storage - Persistence layer (cookies, localStorage, IndexedDB, etc.)

Summary

User accesses website through browser (www.a.com)
Browser identifies server’s IP address through DNS
Browser and server perform 3-Way Handshake
Browser sends HTTP Request to server
Server sends HTTP Response to browser
Browser parses HTML to create DOM Tree
Upon encountering Style tags, pauses DOM creation to parse CSS and create CSSOM Tree
When encountering script tags, passes control to JavaScript engine to parse and create AST
Creates Render Tree by combining DOM + CSSOM
This process is called Construction
Rendering engine performs Layout on Render Tree nodes
UI backend draws UI by traversing Render Tree nodes (Painting)
Finally, composes nodes in Render Tree in order (Composition)
This process is called Operation
Displays final result to web user

Web Browser Working Process Flow Chart

graph TD; A[User accesses website www.a.com] --> B[DNS Lookup: Resolve IP address]; B --> C[3-Way Handshake SYN → SYN/ACK → ACK]; C --> D[Send HTTP Request to Server]; D --> E[Receive HTTP Response]; E --> F[Parse HTML → Create DOM Tree]; F --> G{Style tag detected?}; G -- Yes --> H[Parse CSS → Create CSSOM Tree]; H --> F; G -- No --> I{Script tag detected?}; I -- Yes --> J[Parse JavaScript → Create AST]; J --> F; I -- No --> K[Merge DOM + CSSOM → Render Tree]; K --> L[Layout: Position Elements]; L --> M[Painting: Render UI]; M --> N[Composition: Organize Layers z-index]; N --> O[👀 Display Rendered Page to User]; %% Additional Explanation E -.-> P[⚡ Partial Rendering for Faster Display]; P --> F;

🔍 Detailed Process

Construction Phase

STEP 1: Browser - DNS
- User enters website URL (www.a.com).
- Browser checks its cache for DNS records.
- If not found, browser queries DNS resolver cache.
- If still not found, DNS server performs recursive query to find IP.
- DNS returns IP address (e.g., 1.1.1.1).
STEP 2: Browser - Server
- Browser connects to server with IP address using random sequence number.
- Performs 3-Way Handshake (SYN → SYN/ACK → ACK).
- For HTTPS, TLS handshake occurs (cipher suites, certificate validation).
- Browser sends HTTP Request with headers (User-Agent, Accept, etc.).
- Server processes request and prepares response.
- Server responds with HTTP Response (status code, headers, content).
STEP 3: Browser - Parsing
- Browser parses received data according to W3C specifications.
- Rendering engine creates DOM Tree from HTML (document object model).
- When encountering Style tags:
- Pauses DOM creation.
- Parses CSS to create CSSOM Tree (CSS object model).
- Prioritizes render-critical CSS.
- Resolves styles with specificity rules.
- Resumes DOM creation.
- When encountering Script tags:
- Pauses parsing (unless async/defer attributes).
- Passes control to JS Engine.
- Creates AST (Abstract Syntax Tree).
- Compiles JavaScript to bytecode.
- Executes JavaScript code which may modify DOM/CSSOM.
- Creates Render Tree by combining DOM + CSSOM.
- Render Tree only includes visible elements (excludes display:none).

Operation Phase

STEP 1: Layout
- Rendering engine calculates exact position and size of each element.
- Computes the geometry of all elements on the page (width, height, position).
- Determines how elements affect each other (ex: parent element dimensions).
- Handles responsive layouts with media queries.
- This process was historically called "Reflow" in some browsers.
STEP 2: Painting
- UI Backend converts layout information into actual pixels on screen.
- Draws every visual part of the elements (text, colors, borders, shadows, etc.).
- Creates multiple layers when necessary for efficient updates.
- Uses GPU acceleration for certain CSS properties when available.
STEP 3: Composition
- Combines the painted layers into final screens.
- Arranges node layers in order (based on z-index).
- Lower z-index elements first, followed by higher ones.
- Handles transparency and blending between layers.
- Most efficient for animations and scrolling (avoids repainting).

Critical Rendering Path

The Critical Rendering Path is the sequence of steps browsers take to convert HTML, CSS, and JavaScript into actual pixels on the screen:

HTML Processing → DOM: Parse HTML to create the Document Object Model
CSS Processing → CSSOM: Parse CSS to create the CSS Object Model
JavaScript Execution: Execute JavaScript that might modify DOM and CSSOM
Render Tree Construction: Combine DOM and CSSOM into a render tree
Layout: Calculate the exact position and size of each element
Paint: Fill in pixels for all visible content
Composite: Draw the layers in the correct order

Optimizing the Critical Rendering Path:

Minimize number of critical resources (HTML, CSS, JS needed for initial render)
Minimize critical path length by optimizing the order resources are loaded
Minimize number of critical bytes by compressing and optimizing resources

Performance Optimization Techniques

Resource Loading Optimization:
- Use async and defer attributes for non-critical JavaScript
- Load critical CSS inline and non-critical CSS asynchronously
- Implement resource hints: preload, prefetch, preconnect
- Use HTTP/2 for parallel loading of resources
Rendering Optimization:
- Avoid layout thrashing (multiple forced reflows)
- Use CSS will-change property for elements that will animate
- Use hardware-accelerated CSS properties (transform, opacity) for animations
- Implement code-splitting to reduce initial JavaScript load
Measurement Tools:
- Lighthouse for performance auditing
- Chrome DevTools Performance panel
- WebPageTest for detailed waterfall analysis
- Core Web Vitals metrics (LCP, FID, CLS)

Browser Differences

Different browsers use different rendering engines and JavaScript engines:

Browser	Rendering Engine	JavaScript Engine
Chrome	Blink	V8
Firefox	Gecko	SpiderMonkey
Safari	WebKit	JavaScriptCore (Nitro)
Edge (modern)	Blink	V8
Internet Explorer	Trident	Chakra

Key Differences:

Feature support (check caniuse.com for compatibility)
Performance characteristics
Developer tools capabilities
Implementation of standards
Security model and sandboxing

Additional Notes

The parsing, layout, and UI drawing processes don’t wait for all data to be received from the server. For faster user experience:

Browser starts displaying content as soon as partial data is received
Continues this process as more data arrives
This explains why web pages load progressively rather than all at once

Real-World Example: Loading a Modern Web Page

When loading a typical website (e.g., an e-commerce site):

Browser resolves DNS and establishes HTTPS connection
Receives initial HTML (first contentful paint may occur)
Requests CSS files referenced in HTML
Requests JavaScript files (async/defer determines when they execute)
Requests web fonts
JavaScript might fetch additional data via AJAX/fetch API
Single Page Applications (SPAs) perform client-side rendering after initial load
Progressive Web Apps (PWAs) might use service workers to cache resources
Third-party scripts (analytics, ads) might load additional resources
Lazy-loading might defer images and other content until scrolled into view

🔑 Key Points

CSS is a render-blocking resource, not a parsing-blocking resource
JavaScript execution blocks parsing (unless async/defer is used)
Progressive rendering improves perceived performance
DOM and CSSOM construction must complete before render tree creation
Layout and painting are computationally expensive operations
Compositing allows for efficient animations and scrolling

somaz v3.1.2

How Web Browsers Work

Overview

Browser Components

Summary

Web Browser Working Process Flow Chart

🔍 Detailed Process

Critical Rendering Path

Performance Optimization Techniques

Browser Differences

Additional Notes

Real-World Example: Loading a Modern Web Page

🔑 Key Points

Reference

Understanding DNS - How Domain Name System Works

Somaz

Comments

How Web Browsers Work

Overview

Browser Components

Summary

Web Browser Working Process Flow Chart

🔍 Detailed Process

Critical Rendering Path

Performance Optimization Techniques

Browser Differences

Additional Notes

Real-World Example: Loading a Modern Web Page

🔑 Key Points

Reference

Understanding DNS - How Domain Name System Works

Share

Somaz

Comments