Available Tools
eko provides various built-in tools for different environments including:
Browser extension
Import tools and initialize context:
WebSearch
web_search
: Use web search to return search results
ElementClick
element_click
: Click the element through task prompts
FindElementPosition
find_element_position
: Locate Element Coordinates through Task Prompts
OpenUrl
open_url
: Open the specified URL link in browser window
Screenshot
screenshot
: Screenshot the current webpage window
TabManagement
tab_management
: Browser tab management, view and operate tabs.
The commond to perform. The available commonds are:
tab_all
: View all tabs and return the tabId and title.current_tab
: Get current tab information (tabId, url, title).go_back
: Go back to the previous page in the current tab.change_url [url]
: open URL in the current tab, eg:change_url https://www.google.com
.close_tab
: Close the current tab.switch_tab [tabId]
: Switch to the specified tab using tabId, eg:switch_tab 1000
.new_tab [url]
: Open a new tab window and open the URL, eg:new_tab https://www.google.com
ExtractContent
extract_content
: Extract the text content of the current webpage
ExportFile
export_file
: Content exported as a file, support text format
Browser Use
browser_use
: Use structured commands to interact with the browser, manipulating page elements through screenshots and webpage element extraction.
The action to perform. The available actions are:
screenshot_extract_element
: Take a screenshot of the web page and extract operable elements.- Screenshots are used to understand page layouts, with labeled bounding boxes corresponding to element indexes. Each bounding box and its label share the same color, with labels typically positioned in the top-right corner of the box.
- Screenshots help verify element positions and relationships. Labels may sometimes overlap, so extracted elements are used to verify the correct elements.
- In addition to screenshots, simplified information about interactive elements is returned, with element indexes corresponding to those in the screenshots.
input_text
: Enter a string in the interactive element.clear_text
: Clear the text in the input/textarea element.click
: Click to element.right_click
: Right-click on the element.double_click
: Double-click on the element.scroll_to
: Scroll to the specified element.extract_content
: Extract the text content of the current webpage.get_dropdown_options
: Get all options from a native dropdown element.select_dropdown_option
: Select dropdown option for interactive element index by the text of the option you want to select.
Web
Import tools and initialize context:
ElementClick
element_click
: Click the element through task prompts
FindElementPosition
find_element_position
: Locate Element Coordinates through Task Prompts
Screenshot
screenshot
: Screenshot the current webpage window
ExtractContent
extract_content
: Extract the text content of the current webpage
ExportFile
export_file
: Content exported as a file, support text format
Browser Use
browser_use
: Use structured commands to interact with the browser, manipulating page elements through screenshots and webpage element extraction.
The action to perform. The available actions are:
screenshot_extract_element
: Take a screenshot of the web page and extract operable elements.- Screenshots are used to understand page layouts, with labeled bounding boxes corresponding to element indexes. Each bounding box and its label share the same color, with labels typically positioned in the top-right corner of the box.
- Screenshots help verify element positions and relationships. Labels may sometimes overlap, so extracted elements are used to verify the correct elements.
- In addition to screenshots, simplified information about interactive elements is returned, with element indexes corresponding to those in the screenshots.
input_text
: Enter a string in the interactive element.clear_text
: Clear the text in the input/textarea element.click
: Click to element.right_click
: Right-click on the element.double_click
: Double-click on the element.scroll_to
: Scroll to the specified element.extract_content
: Extract the text content of the current webpage.get_dropdown_options
: Get all options from a native dropdown element.select_dropdown_option
: Select dropdown option for interactive element index by the text of the option you want to select.
Node.js
Import tools:
CommandExecute
command_execute
: Execute a shell command with user confirmation
FileRead
file_read
: Read content from a file
FileWrite
file_write
: Write content to a file with user confirmation
Fellou browser
The current tools can be used in both extension and web environments of the Fellou browser.
Computer Use
computer_use
: Use a mouse and keyboard to interact with a computer, and take screenshots.
The action to perform. The available actions are:
key
: Press a key or key-combination on the keyboard.
- This supports robotgo hotkey syntax.
- Multiple keys are combined using the ”+” symbol.
- Examples: “a”, “enter”, “ctrl+s”, “command+shift+a”, “num0”.
type
: Type a string of text on the keyboard.cursor_position
: Get the current (x, y) pixel coordinate of the cursor on the screen.mouse_move
: Move the cursor to a specified (x, y) pixel coordinate on the screen.left_click
: Click the left mouse button.left_click_drag
: Click and drag the cursor to a specified (x, y) pixel coordinate on the screen.right_click
: Click the right mouse button.double_click
: Double-click the left mouse button.screenshot
: Take a screenshot of the screen.scroll
: Scroll to the specified (x, y) pixel coordinates on the screen.
Example: Search Elon Musk on Google
NOTICE: This example can only run on Browser Extension Enviroment.