Google announced Veo AI for video creation and PaliGemma 2 for image and text processing

Google logo

Google has introduced a new generative AI model called Veo, designed for video content creation. The tool, available through the Vertex AI platform, allows users to generate videos longer than a minute and in 1080p resolution.

 

No special skills are required to work with Veo – just ask a text query or upload an image. The AI ​​transforms the input data into a video sequence, taking into account the visual style and cinematic effects selected by the user.

 

The resulting videos can be edited, adjusted for individual elements, and added personalized details such as logos. Veo is aimed at creative professionals, marketers, and content creators, helping them quickly create visually appealing materials.

 

Google Gemma

 

Google has introduced a new version of the PaliGemma 2 model, which is an evolution of the previous version and is designed to work with text and images. The announcement was made after a demonstration of Gemma 2’s capabilities at the I/O 2024 conference.

 

The PaliGemma 2 model expands the capabilities of its predecessor, which was focused on adding captions to images and videos, recognizing text, analyzing objects, and responding to visual queries. The new version has received a “long caption” function, which allows you to generate more detailed descriptions of visual content, including actions, emotions, and the general context of the scene.

 

Improved features:

  • Long description generation: taking into account complex details and scene atmosphere.
  • Analysis of complex structures: improved recognition of tables, chemical formulas, and musical scores.
  • Spatial thinking: more accurate analysis of X-ray images and other medical data.

 

PaliGemma 2 is available in several versions with different numbers of parameters (3B, 10B, 28B), which allows it to be adapted to different tasks and data volumes. Performance is significantly improved, and the new model is compatible with the previous one, which simplifies its integration.

 

For the convenience of users, the PaliGemma 2 models and code are already hosted on platforms such as Kaggle, Hugging Face, and Ollama, making it accessible to developers and researchers.


Don't miss interesting news

Subscribe to our channels and read announcements of high-tech news, tes

Leave a Reply

Your email address will not be published. Required fields are marked *





Articles & testsArticles

Oppo A6 Pro smartphone review: ambitious Oppo A6 Pro (CPH2799)

Creating new mid-range smartphones is no easy task. Manufacturers have to balance performance, camera capabilities, displays, and the overall cost impact of each component. How the new Oppo A6 Pro balances these factors is discussed in our review.


Logitech G29 Gaming Wheel review: super car on a table Logitech G29 Driving Force Racing Wheel

We’ll tell you about the Logitech G29 gaming wheel for PC and PlayStation, as well as the 6-speed Driving Force Shifter add-on.


NewsNews
| 19.05
Iconic Razer Boomslang mouse remake get new design and features  
Razer Boomslang 2025

Razer celebrated its 20th anniversary with a reissue of the Boomslang, a model that in 1999 became the first specialized gaming mouse and effectively marked the start of the esports era.

| 16.22
Pornhub 2025 results: Ukraine no longer in top 15
sm.pornhub.750

Pornhub has published its annual user activity overview, which presents data on the leading countries in terms of traffic, popular categories and most frequently used devices.