What This Recipe Does
- Navigate to a webpage with Kernel browser
- Capture a screenshot (visual content)
- Extract DOM/text content (semantic content)
- Send both to an LLM for analysis
- Get structured summary or insights
Use Cases
- Monitor competitor product pages for changes
- Summarize articles or documentation
- Extract structured data from unstructured pages
- Analyze landing page messaging
- QA content quality across deployments
Complete Code
Environment Variables
Expected Output
Variations
Use OpenAI GPT-4o Instead
Extract Specific Information
Compare Two Pages
Performance Optimization
Block Unnecessary Resources
Use Persistent Session for Batch Analysis
Common Issues
Screenshot Too Large for LLM
Most LLMs have image size limits (e.g., 20MB for Claude). Reduce screenshot size:LLM Missing Important Content
If content is below the fold or in tabs/dropdowns:Rate Limits
For batch analysis, add rate limiting:Cost Estimation
Per page:- Kernel browser: ~0.05/min)
- Claude API: ~$0.05 (1k tokens output + image)
- Total: ~$0.06/page
Related Recipes
- Block Ads/Trackers - Speed up page loads
- Parallel Browsers - Analyze multiple pages faster
- Auth & Cookies - Analyze logged-in content