返回 Skill 列表
extension
分类: 开发与工程无需 API Key

Windows UI Automation

使用PowerShell自动执行Windows GUI交互(鼠标、键盘、窗口)。适用于需要在桌面上模拟用户输入的场景,如移动光标、点击按钮、在非Web应用中输入文本或管理窗口状态。

person作者: wwb-danielhubclawhub

Windows UI Automation

Control the Windows desktop environment programmatically.

Core Capabilities

  • Mouse: Move, click (left/right/double), drag.
  • Keyboard: Send text, press special keys (Enter, Tab, Alt, etc.).
  • Windows: Find, focus, minimize/maximize, and screenshot windows.

Usage Guide

Mouse Control

Use the provided PowerShell script mouse_control.ps1.txt:

# Move to X, Y
powershell -File skills/windows-ui-automation/mouse_control.ps1.txt -Action move -X 500 -Y 500

# Click at current position
powershell -File skills/windows-ui-automation/mouse_control.ps1.txt -Action click

# Right click
powershell -File skills/windows-ui-automation/mouse_control.ps1.txt -Action rightclick

Keyboard Control

Use keyboard_control.ps1.txt:

# Type text
powershell -File skills/windows-ui-automation/keyboard_control.ps1.txt -Text "Hello World"

# Press Enter
powershell -File skills/windows-ui-automation/keyboard_control.ps1.txt -Key "{ENTER}"

Window Management

To focus a window by title:

$wshell = New-Object -ComObject WScript.Shell; $wshell.AppActivate("Notepad")

Best Practices

  1. Safety: Always move the mouse slowly or include delays between actions.
  2. Verification: Take a screenshot before and after complex UI actions to verify state.
  3. Coordinates: Remember that coordinates (0,0) are at the top-left of the primary monitor.