Screenshot + DOM + LLM Summary

Combine screenshots, DOM extraction, and LLM analysis to understand webpages at scale. Useful for content monitoring, competitor analysis, and automated research.

What This Recipe Does

Navigate to a webpage with Kernel browser
Capture a screenshot (visual content)
Extract DOM/text content (semantic content)
Send both to an LLM for analysis
Get structured summary or insights

Use Cases

Monitor competitor product pages for changes
Summarize articles or documentation
Extract structured data from unstructured pages
Analyze landing page messaging
QA content quality across deployments

Complete Code

import { chromium } from 'playwright-core';
import { Kernel } from '@onkernel/sdk';
import Anthropic from '@anthropic-ai/sdk';

export async function analyzePage(url: string) {
  // Create Kernel browser
  const kernel = new Kernel({ apiKey: process.env.KERNEL_API_KEY! });
  const kb = await kernel.browsers.create({ headless: true });
  
  const browser = await chromium.connectOverCDP({
    wsEndpoint: kb.cdp_ws_url
  });
  
  const page = browser.contexts()[0].pages()[0];
  
  // Navigate and wait for content
  await page.goto(url, { waitUntil: 'networkidle' });
  
  // Capture screenshot
  const screenshot = await page.screenshot({
    type: 'png',
    fullPage: true
  });
  
  // Extract text content
  const textContent = await page.evaluate(() => {
    // Remove scripts, styles
    const clone = document.body.cloneNode(true) as HTMLElement;
    clone.querySelectorAll('script, style, nav, footer').forEach(el => el.remove());
    return clone.innerText;
  });
  
  // Get title and meta
  const title = await page.title();
  const description = await page.$eval(
    'meta[name="description"]',
    el => el.getAttribute('content')
  ).catch(() => null);
  
  await browser.close();
  await kernel.browsers.deleteByID(kb.session_id);
  
  // Analyze with Claude
  const anthropic = new Anthropic({
    apiKey: process.env.ANTHROPIC_API_KEY!
  });
  
  const response = await anthropic.messages.create({
    model: 'claude-sonnet-4-20250514',
    max_tokens: 1024,
    messages: [{
      role: 'user',
      content: [
        {
          type: 'image',
          source: {
            type: 'base64',
            media_type: 'image/png',
            data: screenshot.toString('base64')
          }
        },
        {
          type: 'text',
          text: `Analyze this webpage and provide:
1. Main topic/purpose
2. Key messages or value propositions
3. Call-to-action (if any)
4. Target audience
5. Overall tone/style

Context:
Title: ${title}
Description: ${description || 'N/A'}
Text content (first 2000 chars): ${textContent.slice(0, 2000)}...`
        }
      ]
    }]
  });
  
  return {
    url,
    title,
    description,
    screenshot: screenshot.toString('base64'),
    analysis: response.content[0].type === 'text' 
      ? response.content[0].text 
      : null
  };
}

// Usage in Next.js API route
export default async function handler(req, res) {
  const { url } = req.body;
  const result = await analyzePage(url);
  res.json(result);
}

Environment Variables

KERNEL_API_KEY=your_kernel_api_key
ANTHROPIC_API_KEY=your_anthropic_api_key
# or OPENAI_API_KEY for GPT-4o

Expected Output

{
  "url": "https://example.com",
  "title": "Example Domain",
  "description": "Example meta description",
  "screenshot": "base64_encoded_image...",
  "analysis": "This webpage is a simple example domain...\n\n1. Main topic: Domain placeholder\n2. Key messages: Demonstrates a basic webpage\n3. CTA: Links to 'More information'\n4. Target audience: Web developers, domain researchers\n5. Tone: Neutral, informative"
}

Variations

Use OpenAI GPT-4o Instead

import OpenAI from 'openai';

const openai = new OpenAI({
  apiKey: process.env.OPENAI_API_KEY!
});

const response = await openai.chat.completions.create({
  model: 'gpt-4o',
  messages: [{
    role: 'user',
    content: [
      {
        type: 'image_url',
        image_url: {
          url: `data:image/png;base64,${screenshot.toString('base64')}`
        }
      },
      {
        type: 'text',
        text: 'Analyze this webpage...'
      }
    ]
  }]
});

Extract Specific Information

const response = await anthropic.messages.create({
  model: 'claude-sonnet-4-20250514',
  max_tokens: 1024,
  messages: [{
    role: 'user',
    content: [
      { type: 'image', source: { ... } },
      {
        type: 'text',
        text: `Extract pricing information from this page as JSON:
{
  "plans": [
    {"name": "...", "price": "...", "features": [...]}
  ]
}`
      }
    ]
  }]
});

Compare Two Pages

// Capture both pages
const [page1Data, page2Data] = await Promise.all([
  analyzePage('https://example.com/old'),
  analyzePage('https://example.com/new')
]);

// Compare with LLM
const response = await anthropic.messages.create({
  model: 'claude-sonnet-4-20250514',
  messages: [{
    role: 'user',
    content: [
      { type: 'image', source: { type: 'base64', media_type: 'image/png', data: page1Data.screenshot } },
      { type: 'image', source: { type: 'base64', media_type: 'image/png', data: page2Data.screenshot } },
      { type: 'text', text: 'Compare these two pages. What changed?' }
    ]
  }]
});

Performance Optimization

Block Unnecessary Resources

// Before page.goto
await page.route('**/*', route => {
  const type = route.request().resourceType();
  if (['image', 'font', 'stylesheet'].includes(type)) {
    return route.abort();
  }
  return route.continue();
});

// Page loads faster, smaller screenshot

Use Persistent Session for Batch Analysis

const kb = await kernel.browsers.create({
  persistent: true,
  persistent_id: 'analyzer-session',
  headless: true
});

// Reuse browser for multiple pages
for (const url of urls) {
  const page = browser.contexts()[0].pages()[0];
  await page.goto(url);
  // ... analyze ...
  await page.goto('about:blank'); // Clear page
}

Common Issues

Screenshot Too Large for LLM

Most LLMs have image size limits (e.g., 20MB for Claude). Reduce screenshot size:

const screenshot = await page.screenshot({
  type: 'jpeg', // JPEG compresses better than PNG
  quality: 80,
  fullPage: false // Just viewport, not full page
});

LLM Missing Important Content

If content is below the fold or in tabs/dropdowns:

// Expand all sections
await page.evaluate(() => {
  document.querySelectorAll('details').forEach(el => el.setAttribute('open', ''));
  document.querySelectorAll('[aria-expanded="false"]').forEach(el => el.click());
});

// Scroll to load lazy content
await page.evaluate(() => window.scrollTo(0, document.body.scrollHeight));
await page.waitForTimeout(1000);

// Then capture screenshot

Rate Limits

For batch analysis, add rate limiting:

import pLimit from 'p-limit';

const limit = pLimit(5); // Max 5 concurrent analyses

const results = await Promise.all(
  urls.map(url => limit(() => analyzePage(url)))
);

Cost Estimation

Per page:

Kernel browser: ~ $0.01 (2s @$ 0.05/min)
Claude API: ~$0.05 (1k tokens output + image)
Total: ~$0.06/page

1,000 pages: ~$60

Block Ads/Trackers - Speed up page loads
Parallel Browsers - Analyze multiple pages faster
Auth & Cookies - Analyze logged-in content

Support

Questions? Join our Discord to discuss AI + browser automation patterns.

Data Extraction

Authentication & Files

Scaling & Testing

Screenshot + DOM + LLM Summary

What This Recipe Does

Use Cases

Complete Code

Environment Variables

Expected Output

Variations

Use OpenAI GPT-4o Instead

Extract Specific Information

Compare Two Pages

Performance Optimization

Block Unnecessary Resources

Use Persistent Session for Batch Analysis

Common Issues

Screenshot Too Large for LLM

LLM Missing Important Content

Rate Limits

Cost Estimation

Support

Data Extraction

Authentication & Files

Scaling & Testing

​What This Recipe Does

​Use Cases

​Complete Code

​Environment Variables

​Expected Output

​Variations

​Use OpenAI GPT-4o Instead

​Extract Specific Information

​Compare Two Pages

​Performance Optimization

​Block Unnecessary Resources

​Use Persistent Session for Batch Analysis

​Common Issues

​Screenshot Too Large for LLM

​LLM Missing Important Content

​Rate Limits

​Cost Estimation

​Related Recipes

​Related Features

​Support

What This Recipe Does

Use Cases

Complete Code

Environment Variables

Expected Output

Variations

Use OpenAI GPT-4o Instead

Extract Specific Information

Compare Two Pages

Performance Optimization

Block Unnecessary Resources

Use Persistent Session for Batch Analysis

Common Issues

Screenshot Too Large for LLM

LLM Missing Important Content

Rate Limits

Cost Estimation

Related Recipes

Related Features

Support