LaVague is an open-source Large Action Model framework designed to help developers build powerful AI Web Agents that automate complex web tasks by understanding natural language objectives. It combines a World Model, which interprets your goals and current web context, with an Action Engine that translates instructions into executable code using web automation tools like Selenium or Playwright. LaVague enables anyone to create agents capable of navigating, interacting, and automating websites with minimal coding, making web automation more accessible and flexible.
Key Features:
Automatically generate and execute multi-step web actions from simple natural language commands.
Supports multiple web drivers including Selenium, Playwright, and a Chrome extension for versatile automation.
Built-in tools for performance testing, token counting, logging, debugging, and interactive demos.
Highly customizable with user-configurable settings and context handling to adapt to varied web environments.
Use Cases:
Automate web QA and testing, turning Gherkin specs into executable test scripts.
Streamline repetitive web-based workflows like data entry, form submission, and information extraction.
Create intelligent agents for SaaS platform automation, improving efficiency in business processes.
Technical Specifications:
Developed primarily in Python, compatible with OpenAI, Llama 3, Azure OpenAI, Gemini, and other AI models.
Runs locally or in the cloud with full support for Selenium WebDriver browsers and other automation drivers.
Open-source under Apache-2.0 license, backed by an active community and comprehensive documentation.