Type something to search...

Let an AI Agent Resize a Window and Take the Screenshot

Screen automation is easiest to trust when the result is visible and repeatable. In this short demo, an AI agent uses the OverRec screen skill to find a browser window, resize it to an exact rectangle, and capture a clean screenshot without manual dragging.


What the Demo Shows

The target is a Chrome window open to the Hermes Agent website. The agent is running in a terminal and has access to the overrec-screen skill, which wraps OverRec’s window tools.

The workflow is simple:

  1. Confirm OverRec is available
  2. Find the target window
  3. Move and resize it to an exact position
  4. Take a screenshot of that exact region
  5. Copy the screenshot to the requested output path

No manual window positioning is involved.


Step 1: Ask the Agent for the Screen Task

The agent is given a task like:

Use overrec-screen skill to resize the Hermes Agent window to 800x600
and take a screenshot.

The important part is that the instruction describes the desired result, not every low-level command. The skill gives the agent a reliable way to translate that request into window operations.


Step 2: Find the Window

The agent first searches for a matching window title:

OverRec.exe cli window hermes

When the exact title is not found, it broadens the search by listing visible windows:

OverRec.exe cli window --all hermes
OverRec.exe cli window edge
OverRec.exe cli window

In the demo, the Hermes Agent page is open in Google Chrome, and OverRec returns its window ID. Once the agent has that ID, it can control the window directly.


Step 3: Snap the Browser to an Exact Rectangle

The agent snaps the Chrome window to position 150,150 with size 800x600:

OverRec.exe cli snap --windowid 264604 --location 150,150 --size 800x600

OverRec restores the window if needed, moves it, resizes it, and brings it forward. The result is a predictable browser frame in the same place every time.

That repeatability matters for:

  • documentation screenshots
  • UI comparison images
  • agent workflows that need visual confirmation
  • bug reports where the capture should show the same region every run

Step 4: Capture the Screenshot

After the window is positioned, the agent uses OverRec to capture the same rectangle:

OverRec.exe cli screenshot --location 150,150 --size 800x600 --output "C:\Users\HP\storage\screenshots\overrec\hermes_agent.png" --no-clipboard

The screenshot is written directly to disk, so the workflow does not depend on the clipboard or a manual save step.


Why This Is Useful for AI Workflows

AI agents can reason about tasks, but desktop screenshots are only useful when the screen state is controlled. If a browser is half off-screen, hidden behind another app, or slightly different in size every time, screenshots become noisy.

OverRec gives the agent deterministic screen control:

  • find a window by title
  • place it at exact coordinates
  • resize it to exact dimensions
  • capture the exact region needed

That turns “take a screenshot of this app” from a fragile visual task into a repeatable automation step.


The Core Pattern

For any window, the pattern is:

OverRec.exe cli window <search text>
OverRec.exe cli snap --windowid <ID> --location <X,Y> --size <WIDTH>x<HEIGHT>
OverRec.exe cli screenshot --location <X,Y> --size <WIDTH>x<HEIGHT> --output "<file>"

Use the GUI when you want to draw the rectangle by hand. Use the CLI when you want the same result from a script, terminal, or AI agent.

Download OverRec from the Microsoft Store.

Related Posts

Learn Vocabulary Effortlessly with Infini Alchemy

Learn Vocabulary Effortlessly with Infini Alchemy

Are you looking for a fun and engaging game that can also help you learn new words? Look no further! Infini Alchemy is a creative alchemy game where you can combine basic elements like Water, Fire, Ea

read more
How to copy the formulas in ChatGPT response to word

How to copy the formulas in ChatGPT response to word

Copy the formulas in ChatGPT response to word, simply 3 steps:Copy the ChatGPT response as markdown (keep LaTeX formulas) Select the formulas in Word and insert as equation Convert LaTeX fo

read more
Master KET Vocabulary Through Alchemy: A New Learning Adventure

Master KET Vocabulary Through Alchemy: A New Learning Adventure

Infini Alchemy is an innovative web-based game that transforms vocabulary learning into an engaging alchemical adventure. By dragging and dropping elements to craft new items, players naturally absorb

read more
Real-Time Video Effects with GPU Acceleration

Real-Time Video Effects with GPU Acceleration

GPU rendering is now available in Sub Dimension Camera. This feature eliminates the need to wait for a video to "render" after recording. You can now preview and capture complex artistic filters in re

read more
See the World Differently: Camera Effects in Sub Dimension

See the World Differently: Camera Effects in Sub Dimension

Your camera captures moments. Effects transform them into art. Sub Dimension comes packed with real-time visual effects that apply live in the viewfinder — what you see is what you get, whether you'r

read more

Move Any Window to an Exact Position and Size

You've set up a perfect workflow: your code editor on the left, browser on the right, terminal at the bottom. Then you restart your machine and spend the next five minutes dragging windows back into p

read more

将任意窗口精确定位到指定位置和尺寸

你精心搭建了完美的工作流:左边代码编辑器,右边浏览器,底部终端。然后重启电脑,又花了五分钟把窗口一个个拖回原位——差不多,但就是不够准。 OverRec 解决了这个问题。手动拖拽窗口的局限 用鼠标拖拽窗口本来就不精确。你瞄准第 960 像素,落点却是 963。你把窗口调到"看起来差不多",却在一秒后发现那道缝隙。日常使用无所谓,但对于录屏、文档截图或需要重复使用的工作区布局

read more

Mueve Cualquier Ventana a una Posición y Tamaño Exactos

Has configurado el flujo de trabajo perfecto: tu editor de código a la izquierda, el navegador a la derecha, la terminal abajo. Luego reinicias el equipo y pasas los siguientes cinco minutos arrastran

read more

Deja que un Agente de IA Redimensione una Ventana y Tome la Captura

La automatización de pantalla es más fácil de confiar cuando el resultado es visible y repetible. En esta breve demo, un agente de IA usa la habilidad de pantalla de OverRec para encontrar una ventana

read more

让 AI 智能体调整窗口大小并完成截图

屏幕自动化要让人放心,结果就必须可见、可重复。在这个简短演示中,AI 智能体使用 OverRec 屏幕技能找到一个浏览器窗口,将其调整到精确矩形,并在无需手动拖拽的情况下截取干净的截图。

read more