Google boosts Gemini, Flow AI; Intrinsic robotics folds into Google
TL;DR
- 1Gemini sur Android automatise les tâches multi-étapes ; Circle to Search identifie plusieurs éléments dans une image.
- 2Le studio créatif IA de Google, Flow, est relancé en tant que plateforme tout-en-un avec des fonctions génératives gratuites.
- 3L'entreprise de robotique Intrinsic s'intègre à Google, utilisant Gemini et DeepMind pour faire avancer l'IA physique.
Google has announced a significant expansion of its artificial intelligence capabilities, deepening AI integration across its consumer products Gemini and Flow, while strategically consolidating its robotics efforts with Intrinsic. This multi-pronged approach signals Google's intent to deliver more pervasive and powerful AI experiences, from automated mobile tasks to advanced creative work and real-world robotics.
For users of its mobile AI, Gemini on Android is now equipped to automate multi-step tasks, significantly enhancing its utility beyond conversational interactions. This update allows Gemini to manage complex requests such as coordinating rideshare services or handling grocery and food delivery orders directly from the device, streamlining daily chores for users and pushing Gemini further into the realm of a proactive personal assistant (TechCrunch AI). Complementing this, the popular Circle to Search feature has been upgraded, enabling it to explore and identify multiple items within a single image. This enhancement dramatically improves the accuracy and comprehensiveness of visual searches, offering a more intuitive way for users to gather information or shop for multiple products simultaneously (Google AI Blog).
In the creative AI sphere, Google has relaunched its AI creative studio, Flow, transforming it into an all-in-one platform for image and video generation and editing. Flow now boasts free image generators and a suite of new editing features, positioning it as a robust competitor in the burgeoning market for generative AI tools. This move aims to democratize access to advanced creative AI, allowing a broader user base to leverage sophisticated content creation capabilities previously reserved for more specialized or costly applications (The Decoder).
Perhaps the most strategic consolidation comes with Intrinsic, Alphabet's robotics software company, formally moving under Google's domain. Nearly five years after its spin-out, Intrinsic will now integrate directly with Google's formidable AI ecosystem, leveraging Gemini models and Google Cloud infrastructure, and collaborating closely with Google DeepMind (TechCrunch AI, CNBC Tech). This integration signifies a crucial step in Google's long-term vision for robotics, strengthening its foundational capabilities in physical AI and potentially accelerating the development of advanced robotic solutions for industrial automation and beyond. Google's ambition for Intrinsic is significant, with leadership reportedly envisioning the company as the 'Android of robotics,' aiming to provide a standardized, accessible software platform for the industry (CNBC Tech). This internal unification is set to streamline innovation, moving cutting-edge AI research closer to real-world applications within Google's operational framework. In a related development showcasing Alphabet's wider ambitions in physical AI, its autonomous driving subsidiary Waymo recently announced the expansion of its robotaxi service to 'select riders' in four new major U.S. cities: Houston, Dallas, San Antonio, and Orlando (CNBC Tech). This expansion highlights the accelerating deployment of AI in real-world autonomous systems. Concurrently, the global race in autonomous AI continues to draw significant investment, as evidenced by European AI driverless car start-up Wayve raising a substantial $1.2 billion, underscoring the intense market focus on this transformative technology (NYT Tech). This global competition is further highlighted by Chinese tech company Honor, which recently showcased a smartphone with an integrated robotic camera arm and teased plans for a humanoid robot, demonstrating diverse approaches to integrating AI and robotics into consumer products and beyond (CNBC Tech). Beyond these applied advancements, Google's commitment to foundational AI research remains robust. Google AI recently unveiled STATIC, a sparse matrix framework designed to deliver significantly faster constrained decoding for LLM-based generative retrieval, boasting up to a 948x speed improvement (MarkTechPost). Similarly, Google DeepMind introduced Unified Latents (UL), a new machine learning framework that enhances latent space regularization using diffusion priors and decoders, pushing the boundaries of generative model capabilities (MarkTechPost).
This comprehensive update package underscores Google's aggressive strategy to embed advanced AI across its product portfolio. From intuitive mobile assistants and powerful creative tools to foundational robotics and cutting-edge research, these developments empower users with more sophisticated AI capabilities and solidify Google's competitive stance in the rapidly evolving artificial intelligence landscape.
Sources
Weekly AI Newsletter
Trends, new tools, and exclusive analyses delivered weekly.