OpenAI has developed an AI assistant for computer control and browser automation
30.01.25
OpenAI has introduced a new AI agent called Operator, designed to automate actions in the browser. The tool can interact with interface elements such as buttons, text fields, and scrolling, imitating user actions.
The basis of the work of Operator was the Computer-Using Agent (CUA) model, which combines the image recognition capabilities of GPT-4 with an advanced analysis and decision-making mechanism. The algorithm works in stages: first, a screenshot is created, then the system analyzes it, determines the necessary actions and simulates them using virtual mice and keyboards. Users can observe the process through a small window in the browser.
At the moment, Operator shows the best results in performing routine and repetitive tasks, such as making shopping lists or playlists. However, the agent faces difficulties when working with unfamiliar interfaces, for example, tables, calendars, or when editing complex texts.
Although the technology is in its early stages of development, it promises to be a powerful tool for automating routine processes and working with the browser.
Don't miss interesting news
Subscribe to our channels and read announcements of high-tech news, tes

Review of Samsung Galaxy A36 and Galaxy A56 smartphones: in a shadow of light



The Samsung Galaxy A36 and Galaxy A56 have equally good displays, large batteries, and support for software updates for 6 years. Let’s talk in more detail about what else makes them interesting.

New GPMI Universal Media Interface has bandwidth up to 192 Gbps and power of 480 W development
Major Chinese companies have introduced a new connection standard, General Purpose Media Interface (GPMI), which is physically compatible with USB Type-C and USB Type-B connectors
Microsoft turns 50 years history Microsoft
The history of one of the largest software manufacturers began on April 4, 1975 in Albuquerque, New Mexico.