🧪 Skills
Virtual Desktop Browser
Launch Chromium in non-headless mode inside Xvfb virtual display (fixed 1200x720x24) and automate with human-like mouse/keyboard/screenshot operations. Use f...
v1.0.0
Description
name: virtual-desktop-browser description: Launch Chromium in non-headless mode inside Xvfb virtual display (fixed 1200x720x24) and automate with human-like mouse/keyboard/screenshot operations. Use for bot-resistant sites like Xiaohongshu and X/Twitter where GUI simulation is required.
Virtual Desktop Browser Skill
This skill provides a deterministic GUI automation runtime:
- Xvfb virtual display (
1200x720x24) - Chromium non-headless browser
- PyAutoGUI control (click/type/hotkey/scroll)
Required system packages
apt-get update
apt-get install -y xvfb chromium-browser \
libnss3 libgconf-2-4 libxss1 libasound2 \
libatk1.0-0 libatk-bridge2.0-0 libcups2 \
libdrm2 libgbm1 libgtk-3-0 libxshmfence1 x11-utils
Python dependencies
pip install -r requirements.txt
Tool surface
All tools return JSON dicts.
browser_start(url=None, display=None)— start Xvfb + Chromiumbrowser_stop()— close processesbrowser_snapshot(region=None)— screenshot as Base64 PNGbrowser_click(x, y, button='left', clicks=1, duration=0.5)— move + clickbrowser_type(text, interval=0.05, wpm=None)— type at focusbrowser_hotkey(keys, interval=0.05)— key combosbrowser_scroll(clicks=1, direction='vertical', x=None, y=None)— mouse wheelbrowser_find_image(image_path, confidence=0.8)— template matching (OpenCV)browser_get_pixel_color(x, y)— read RGBbrowser_activate_window(title_substring)— focus window by title
Notes
- Browser lifecycle is manual: start once, multi-step flow, then stop.
- Display auto-assigns (
:99..:199) if not provided. - Failsafe: moving mouse to bottom-right corner triggers pyautogui abort.
Language
See docs/ for translations:
Reviews (0)
Sign in to write a review.
No reviews yet. Be the first to review!
Comments (0)
No comments yet. Be the first to share your thoughts!