StreamingMemeStreamingMeme
LeaderboardsEventsSubmit News
SUBSCRIBE

Daily Brief

The streaming industry in your inbox every morning.

Daily Brief

The streaming industry in your inbox every morning.

StreamingMeme

The streaming technology industry news aggregator.

About UsNewsletterSubmit News
© 2026 StreamingMeme. All rights reserved.
← AI for Video
AI & VideoProduct LaunchMarch 10, 2026

Google adds multimodal embeddings across text, video, audio

Google adds multimodal embeddings across text, video, audio
Google

Google has launched Gemini Embedding 2, its first natively multimodal embedding model, now available in Public Preview via the Gemini API and Vertex AI. This model can map text, images, video, audio, and documents (up to 6 pages) into a single embedding space, facilitating multimodal retrieval and classification and enhancing AI applications like Retrieval-Augmented Generation (RAG) and semantic search.

Key Takeaways

  • Gemini Embedding 2 is Google’s first natively multimodal embedding model, available now in Public Preview.
  • The model accepts text, up to 6 images, up to 120 seconds of video, audio, and PDFs up to 6 pages.
  • Google says the model supports semantic intent across more than 100 languages and interleaved inputs like image + text in one request.
  • The embedding output uses Matryoshka Representation Learning, with dimensions that can scale down from 3072 to 1536 or 768.
  • Early partners cited results such as Paramount Skydance’s 85.3% text-to-video Recall@1 and Mindlid’s 20% lift in top-1 recall.

Why It Matters

Gemini Embedding 2 gives streaming and media teams a single embedding layer for text, images, video, audio, and short documents instead of separate pipelines for each format. Google positions it for multimodal retrieval, RAG, semantic search, sentiment analysis, and clustering, and says it already supports tools like LangChain, LlamaIndex, Haystack, Weaviate, QDrant, ChromaDB, and Vertex AI Vector Search. The clearest near-term signal to watch is whether developers adopt the preview through Gemini API and Vertex AI, and whether the benchmark claims translate into production search and retrieval gains like Paramount Skydance’s 85.3% Recall@1.


Read full article at blog.google

Related Articles

Agora: Agora Integrates OpenAI Real-Time API for Low-Latency Conversational AI
Amazon Web Services, Inc.: AWS SageMaker Adds Multi-Turn RL for Specialized AI Model Training
wTVision: wTVision Debuts CricketStats CG, Enters Cricket Graphics Market in Bangladesh

Newest

about 14 hours ago
Pro AVL Central: Blackmagic Debuts Fairlight Live, Boosts DaVinci Resolve 21 with AI and Photo Tools
about 14 hours ago
NewscastStudio: MXL Rapid Development Challenges Traditional Broadcast Standardization
about 14 hours ago
Smpte: SMPTE Media Technology Summit Returns to Pasadena November 2026
about 14 hours ago
Tech Times: Let's Encrypt charts Merkle Tree Certificate path for post-quantum TLS
about 14 hours ago
cvefeed.io: Netty Fixes Undetected Stream Truncation in Chunked OHTTP Messages
about 14 hours ago
Ietf: IETF Advances Network Protocol Drafts for Streaming Infrastructure
about 14 hours ago
Forasoft: Fora Soft Launches Monthly WebRTC & Real-time Video Engineering Report
about 14 hours ago
Atis: ATIS Outlines Practical Roadmap for North American 5G Standalone Deployment
about 14 hours ago
Youtube: 3GPP Advances 5G-Advanced with Release 19, Commences 6G Studies
about 14 hours ago
3gpp: 3GPP Release 6 Refines Radio Network Rules for Cell Handover, Measurement
about 14 hours ago
3gpp: 3GPP Details 20 Mobile Telecommunications Releases, Including Open Release 21
about 14 hours ago
Pro AVL Central: Matrox Launches IPMX-Ready Maevex MGX Series for 4K60 AV-over-IP
about 14 hours ago
GitHub: OpenMOSS Expands MOSS-TTS Family with Nano Model, Enhanced SoundEffects
about 14 hours ago
NewscastStudio: Media Exchange Layer (MXL) Complements ST 2110 for Software-Defined Production
about 14 hours ago
Penligent Security Blog – AI-Driven Hacking Tutorials, Exploit PoCs & Cybersecurity Research: HTTP/2 Bomb Vulnerability: Apache, Envoy, Nginx Face DoS Risk
about 14 hours ago
SamsungNewsroom: Samsung Galaxy S26 Series Introduces Cine LUT for Accessible Mobile Color Grading
about 15 hours ago
KORE1: Spotify Engineers: A Six-Profile Map for Strategic Hiring
about 15 hours ago
TV Tech: GatesAir Establishes Brazil Hub for DTV+ Rollout, Local Support
about 15 hours ago
Telecompaper: Technicolor Joins Pearl TV Initiative for Affordable ATSC 3.0 Converter Boxes
about 15 hours ago
law360: Generative AI, SEPs Drive IP Licensing Activity from May 22-June 4

Upcoming Events

Jun
8–11
NEM Dubrovnikhttps://neweumarket.com/dubrovnik/
Jun
11–12
Arctic 15https://arctic15.com/
Jun
13–19
InfoCommhttps://www.infocommshow.org/
Jun
16–19
Stream TV Show (formerly the Pay TV Show)https://www.streamtvshow.com/
Jun
17–19
Content Tokyo 2024https://www.content-tokyo.jp/ja-jp.html
View all events →

Top Sources

  1. 1.wTVision163
  2. 2.MSN152
  3. 3.Calendly86
  4. 4.Advanced Television63
  5. 5.Sports Video Group62
  6. 6.TV Technology40
  7. 7.Cord Cutters News40
  8. 8.Broadband TV News35
Full leaderboards →

Newest

about 14 hours ago
Pro AVL Central: Blackmagic Debuts Fairlight Live, Boosts DaVinci Resolve 21 with AI and Photo Tools
about 14 hours ago
NewscastStudio: MXL Rapid Development Challenges Traditional Broadcast Standardization
about 14 hours ago
Smpte: SMPTE Media Technology Summit Returns to Pasadena November 2026
about 14 hours ago
Tech Times: Let's Encrypt charts Merkle Tree Certificate path for post-quantum TLS
about 14 hours ago
cvefeed.io: Netty Fixes Undetected Stream Truncation in Chunked OHTTP Messages
about 14 hours ago
Ietf: IETF Advances Network Protocol Drafts for Streaming Infrastructure
about 14 hours ago
Forasoft: Fora Soft Launches Monthly WebRTC & Real-time Video Engineering Report
about 14 hours ago
Atis: ATIS Outlines Practical Roadmap for North American 5G Standalone Deployment
about 14 hours ago
Youtube: 3GPP Advances 5G-Advanced with Release 19, Commences 6G Studies
about 14 hours ago
3gpp: 3GPP Release 6 Refines Radio Network Rules for Cell Handover, Measurement
about 14 hours ago
3gpp: 3GPP Details 20 Mobile Telecommunications Releases, Including Open Release 21
about 14 hours ago
Pro AVL Central: Matrox Launches IPMX-Ready Maevex MGX Series for 4K60 AV-over-IP
about 14 hours ago
GitHub: OpenMOSS Expands MOSS-TTS Family with Nano Model, Enhanced SoundEffects
about 14 hours ago
NewscastStudio: Media Exchange Layer (MXL) Complements ST 2110 for Software-Defined Production
about 14 hours ago
Penligent Security Blog – AI-Driven Hacking Tutorials, Exploit PoCs & Cybersecurity Research: HTTP/2 Bomb Vulnerability: Apache, Envoy, Nginx Face DoS Risk
about 14 hours ago
SamsungNewsroom: Samsung Galaxy S26 Series Introduces Cine LUT for Accessible Mobile Color Grading
about 15 hours ago
KORE1: Spotify Engineers: A Six-Profile Map for Strategic Hiring
about 15 hours ago
TV Tech: GatesAir Establishes Brazil Hub for DTV+ Rollout, Local Support
about 15 hours ago
Telecompaper: Technicolor Joins Pearl TV Initiative for Affordable ATSC 3.0 Converter Boxes
about 15 hours ago
law360: Generative AI, SEPs Drive IP Licensing Activity from May 22-June 4

Upcoming Events

Jun
8–11
NEM Dubrovnikhttps://neweumarket.com/dubrovnik/
Jun
11–12
Arctic 15https://arctic15.com/
Jun
13–19
InfoCommhttps://www.infocommshow.org/
Jun
16–19
Stream TV Show (formerly the Pay TV Show)https://www.streamtvshow.com/
Jun
17–19
Content Tokyo 2024https://www.content-tokyo.jp/ja-jp.html
View all events →

Top Sources

  1. 1.wTVision163
  2. 2.MSN152
  3. 3.Calendly86
  4. 4.Advanced Television63
  5. 5.Sports Video Group62
  6. 6.TV Technology40
  7. 7.Cord Cutters News40
  8. 8.Broadband TV News35
Full leaderboards →