Files
broswer-automation/docs/CHANGELOG.md
nasir@endelospay.com d97cad1736 first commit
2025-08-12 02:54:17 +05:00

5.5 KiB

Changelog

All notable changes to this project will be documented in this file.

The format is based on Keep a Changelog, and this project adheres to Semantic Versioning.

[v0.0.5]

Improved

  • Image Compression: Compress base64 images when using screenshot tool
  • Interactive Elements Detection Optimization: Enhanced interactive elements detection tool with expanded search scope, now supports finding interactive div elements

[v0.0.4]

Added

  • STDIO Connection Support: Added support for connecting to the MCP server via standard input/output (stdio) method
  • Console Output Capture Tool: New chrome_console tool for capturing browser console output

[v0.0.3]

Added

  • Inject script tool: For injecting content scripts into web page
  • Send command to inject script tool: For sending commands to the injected script

[v0.0.2]

Added

  • Conditional Semantic Engine Initialization: Smart cache-based initialization that only loads models when cached versions are available
  • Enhanced Model Cache Management: Comprehensive cache management system with automatic cleanup and size limits
  • Windows Platform Compatibility: Full support for Windows Chrome Native Messaging with registry-based manifest detection
  • Cache Statistics and Manual Management: User interface for viewing cache stats and manual cache cleanup
  • Concurrent Initialization Protection: Prevents duplicate initialization attempts across components

Improved

  • Startup Performance: Dramatically reduced startup time when no model cache exists (from ~3s to ~0.5s)
  • Memory Usage: Optimized memory consumption through on-demand model loading
  • Cache Expiration Logic: Intelligent cache expiration (14 days) with automatic cleanup
  • Error Handling: Enhanced error handling for model initialization failures
  • Component Coordination: Simplified initialization flow between semantic engine and content indexer

Fixed

  • Windows Native Host Issues: Resolved Node.js environment conflicts with multiple NVM installations
  • Race Condition Prevention: Eliminated concurrent initialization attempts that could cause conflicts
  • Cache Size Management: Automatic cleanup when cache exceeds 500MB limit
  • Model Download Optimization: Prevents unnecessary model downloads during plugin startup

Technical Improvements

  • ModelCacheManager: Added isModelCached() and hasAnyValidCache() methods for cache detection
  • SemanticSimilarityEngine: Added cache checking functions and conditional initialization logic
  • Background Script: Implemented smart initialization based on cache availability
  • VectorSearchTool: Simplified to passive initialization model
  • ContentIndexer: Enhanced with semantic engine readiness checks

Documentation

  • Added comprehensive conditional initialization documentation
  • Updated cache management system documentation
  • Created troubleshooting guides for Windows platform issues

[v0.0.1]

Added

  • Core Browser Tools: Complete set of browser automation tools for web interaction

    • Click Tool: Intelligent element clicking with coordinate and selector support
    • Fill Tool: Form filling with text input and selection capabilities
    • Screenshot Tool: Full page and element-specific screenshot capture
    • Navigation Tools: URL navigation and page interaction utilities
    • Keyboard Tool: Keyboard input simulation and hotkey support
  • Vector Search Engine: Advanced semantic search capabilities

    • Content Indexing: Automatic indexing of browser tab content
    • Semantic Similarity: AI-powered text similarity matching
    • Vector Database: Efficient storage and retrieval of embeddings
    • Multi-language Support: Comprehensive multilingual text processing
  • Native Host Integration: Seamless communication with external applications

    • Chrome Native Messaging: Bidirectional communication channel
    • Cross-platform Support: Windows, macOS, and Linux compatibility
    • Message Protocol: Structured messaging system for tool execution
  • AI Model Integration: State-of-the-art language models for semantic processing

    • Transformer Models: Support for multiple pre-trained models
    • ONNX Runtime: Optimized model inference with WebAssembly
    • Model Management: Dynamic model loading and switching
    • Performance Optimization: SIMD acceleration and memory pooling
  • User Interface: Intuitive popup interface for extension management

    • Model Selection: Easy switching between different AI models
    • Status Monitoring: Real-time initialization and download progress
    • Settings Management: User preferences and configuration options
    • Cache Management: Visual cache statistics and cleanup controls

Technical Foundation

  • Extension Architecture: Robust Chrome extension with background scripts and content injection
  • Worker-based Processing: Offscreen document for heavy computational tasks
  • Memory Management: LRU caching and efficient resource utilization
  • Error Handling: Comprehensive error reporting and recovery mechanisms
  • TypeScript Implementation: Full type safety and modern JavaScript features

Initial Features

  • Multi-tab content analysis and search
  • Real-time semantic similarity computation
  • Automated web page interaction
  • Cross-platform native messaging
  • Extensible tool framework for future enhancements