Files

nasir@endelospay.com d97cad1736 first commit

2025-08-12 02:54:17 +05:00

6.3 KiB

Raw Blame History

Browser Automation Debugging Guide

This guide explains how to use the enhanced debugging features to troubleshoot browser automation issues in the LiveKit Chrome Agent.

Overview

The enhanced debugging system provides comprehensive logging and troubleshooting tools to help identify and resolve issues when browser actions (like "click login button") are not being executed despite selectors being found correctly.

Enhanced Features

1. Enhanced Selector Logging

The system now provides detailed logging for every step of selector discovery and execution:

🔍 SELECTOR SEARCH: Shows what element is being searched for
📊 Found Elements: Lists all interactive elements found on the page
🎯 Matching Elements: Shows which elements match the search criteria
🚀 EXECUTING CLICK: Indicates when an action is being attempted
✅ SUCCESS/❌ FAILURE: Clear indication of action results

2. Browser Connection Validation

Use validate_browser_connection() to check:

MCP server connectivity
Browser responsiveness
Page accessibility
Current URL and page title

3. Step-by-Step Command Debugging

Use debug_voice_command() to analyze:

How commands are parsed
Which selectors are generated
Why actions succeed or fail
Detailed execution flow

Using the Debugging Tools

In LiveKit Agent

When connected to the LiveKit agent, you can use these voice commands:

"debug voice command 'click login button'"
"validate browser connection"
"test selectors 'button.login, #login-btn, .signin'"
"capture browser state"
"get debug summary"

Standalone Testing

Run the test scripts to diagnose issues:

# Test enhanced logging features
python test_enhanced_logging.py

# Test specific login button scenario
python test_login_button_click.py

# Run comprehensive diagnostics
python debug_browser_actions.py

Common Issues and Solutions

Issue 1: "Selectors found but action not executed"

Symptoms:

Logs show selectors are discovered
No actual click happens in browser
No error messages

Debugging Steps:

Run validate_browser_connection() to check connectivity
Use debug_voice_command() to see execution details
Check MCP server logs for errors
Verify browser extension is active

Solution:

Ensure MCP server is properly connected to browser
Check browser console for JavaScript errors
Restart browser extension if needed

Issue 2: "No matching elements found"

Symptoms:

Logs show "No elements matched description"
Interactive elements are found but don't match

Debugging Steps:

Use capture_browser_state() to see page state
Use test_selectors() with common patterns
Check if page has finished loading

Solution:

Try more specific or alternative descriptions
Wait for page to fully load
Use CSS selectors directly if needed

Issue 3: "Browser not responsive"

Symptoms:

Connection validation fails
No response from browser

Debugging Steps:

Check if browser is running
Verify MCP server is running on correct port
Check browser extension status

Solution:

Restart browser and MCP server
Reinstall browser extension
Check firewall/network settings

Enhanced Logging Output

The enhanced logging provides detailed information at each step:

🔍 SELECTOR SEARCH: Looking for clickable element matching 'login button'
📋 Step 1: Getting interactive elements from page
📊 Found 15 interactive elements on page
🔍 Element 0: {"tag": "button", "text": "Sign In", "attributes": {"class": "btn-primary"}}
🔍 Element 1: {"tag": "a", "text": "Login", "attributes": {"href": "/login"}}
✅ Found 2 matching elements:
   🎯 Match 0: selector='button.btn-primary', reason='text_content=sign in'
   🎯 Match 1: selector='a[href="/login"]', reason='text_content=login'
🚀 EXECUTING CLICK: Using selector 'button.btn-primary' (reason: text_content=sign in)
✅ CLICK SUCCESS: Clicked on 'login button' using selector: button.btn-primary

Debug Tools Reference

SelectorDebugger Methods

debug_voice_command(command): Debug a voice command end-to-end
test_common_selectors(selector_list): Test multiple selectors
get_debug_summary(): Get summary of all debug sessions
export_debug_log(filename): Export debug history to file

BrowserStateMonitor Methods

capture_state(): Capture current browser state
detect_issues(state): Analyze state for potential issues

MCPChromeClient Enhanced Methods

validate_browser_connection(): Check browser connectivity
_smart_click_mcp(): Enhanced click with detailed logging
execute_voice_command(): Enhanced voice command processing

Best Practices

Always validate connection first when troubleshooting
Use debug_voice_command for step-by-step analysis
Check browser state if actions aren't working
Test selectors individually to find working patterns
Export debug logs for detailed analysis
Monitor logs in real-time during testing

Log Files

The system creates several log files for analysis:

enhanced_logging_test.log: Main test output
login_button_test.log: Specific login button tests
browser_debug.log: Browser diagnostics
debug_log_YYYYMMDD_HHMMSS.json: Exported debug sessions

Troubleshooting Workflow

Validate Connection

validation = await client.validate_browser_connection()

Debug Command

debug_result = await debugger.debug_voice_command("click login button")

Capture State

state = await monitor.capture_state()
issues = monitor.detect_issues(state)

Test Selectors

results = await debugger.test_common_selectors(["button.login", "#login-btn"])

Analyze and Fix
- Review debug output
- Identify failure points
- Apply appropriate solutions

Getting Help

If issues persist after following this guide:

Export debug logs using export_debug_log()
Check browser console for JavaScript errors
Verify MCP server configuration
Test with simple selectors first
Review the enhanced logging output for clues

The enhanced debugging system provides comprehensive visibility into the browser automation process, making it much easier to identify and resolve issues with selector discovery and action execution.

6.3 KiB Raw Blame History

Browser Automation Debugging Guide

Overview

Enhanced Features

1. Enhanced Selector Logging

2. Browser Connection Validation

3. Step-by-Step Command Debugging

Using the Debugging Tools

In LiveKit Agent

Standalone Testing

Common Issues and Solutions

Issue 1: "Selectors found but action not executed"

Issue 2: "No matching elements found"

Issue 3: "Browser not responsive"

Enhanced Logging Output

Debug Tools Reference

SelectorDebugger Methods

BrowserStateMonitor Methods

MCPChromeClient Enhanced Methods

Best Practices

Log Files

Troubleshooting Workflow

Getting Help

6.3 KiB

Raw Blame History