Everything New in the Gemini App: Image, Video, Voice & Deep Research Now in Your Pocket



The Gemini app has taken a huge leap forward. Following the exciting announcements at Google I/O, Gemini is no longer just a smart chatbot — it’s your personal AI-powered assistant, video producer, image creator, researcher, code collaborator, and even a study buddy. Whether you're a student, professional, content creator, or developer, the new Gemini features are built to empower your workflow, creativity, and everyday tasks like never before.

Let’s break down everything that’s new in Gemini and how you can start using these features today.


1. Gemini Live with Camera and Screen Sharing is Now Free

Show it. Say it. Solve it.
Gemini Live is now available for free on both Android and iOS — and it's a game changer. Ever wished you could just show your phone screen or point your camera at something and get help instantly? With Gemini Live, now you can.

Gemini Live lets you:

  • Share your camera view or screen in real time

  • Troubleshoot complex tasks visually

  • Get hands-on help with anything from fixing a printer to comparing clothes while shopping

The feature is already making waves: conversations using Gemini Live are 5x longer than text-based ones, showing how much more effective it is for real-time problem-solving.

Even better — Google Calendar, Maps, Tasks, and Keep are being integrated. Soon, Gemini can help plan a night out, book events, or organize your tasks directly across your favorite apps.


2. Introducing Imagen 4 and Veo 3: Create Stunning Images and Videos

Creativity is getting a powerful upgrade with two of the most exciting AI models yet:

🎨 Imagen 4

Now built directly into the Gemini app, Imagen 4 is Google’s latest image generation model. It’s known for:

  • Hyper-realistic visuals

  • Improved text rendering and typography

  • Fast and responsive image generation

Perfect for marketers, designers, or anyone who needs to create striking visuals quickly — try generating event invites, social media posts, or even concept art on the fly.

🎬 Veo 3

Gemini also now features Veo 3, the next-gen video generation model. Available to Google AI Ultra subscribers in the U.S., Veo 3 allows users to:

  • Generate high-quality videos from text

  • Add native sound effects, background noise, and even character dialogue

  • Create immersive, cinematic-style content with minimal input

Think of Veo 3 as your personal video production studio, driven by your imagination.


3. Deep Research: Now with Personalized Sources

Gemini’s Deep Research just got deeper.

You can now upload your own PDFs, documents, and images, and Gemini will incorporate them into a comprehensive research report. This means you get a richer, personalized understanding by combining public data with your own insights.

Use cases include:

  • Market analysts blending internal sales data with external reports

  • Academics cross-referencing rare journal articles

  • Soon, Gemini will even support pulling content from Google Drive and Gmail

This upgrade allows users to spot hidden trends, make smarter decisions, and spend less time gathering information.


4. Create Anything in Gemini Canvas

Gemini Canvas is the AI-powered space where ideas become reality.

Whether you’re building a website, designing an infographic, or prototyping an app, Canvas now supports more creative output than ever, powered by the Gemini 2.5 Pro models. What you can do:

  • Build interactive quizzes, infographics, and audio overviews in 45+ languages

  • Convert simple descriptions into fully working code

  • Rapidly develop prototypes with intuitive, real-time feedback

Canvas is also perfect for non-coders who want to "vibe code" — just describe what you want, and Gemini writes and explains the code.


5. Gemini in Chrome: Ask While You Browse

Starting this week, Gemini is coming to Chrome on desktop for Google AI Pro and Ultra subscribers in the U.S.

With Gemini in Chrome, you can:

  • Ask questions about any page you’re reading

  • Summarize long content instantly

  • Clarify complex information with one click

Soon, Gemini will also be able to navigate tabs, cross-reference pages, and even take actions on websites — all on your behalf.


6. Interactive Quizzes: Smarter Studying Starts Now

Forget passive note-reading — Gemini is reinventing how you study. Starting today, you can ask Gemini to:

  • “Create a practice quiz on thermodynamics”

  • Get instant feedback with auto-generated quizzes

  • Receive follow-up quizzes on weaker areas

Gemini makes the learning experience more interactive, customized, and effective. And if you're a student in the U.S., Brazil, Indonesia, Japan, or the UK, you're eligible for a free school year of Gemini Pro.


7. New Plans: Google AI Pro and Google AI Ultra

To access the best of what Gemini has to offer, Google has introduced two new plans:

🔹 Google AI Pro – $19.99/month

The Pro plan replaces Gemini Advanced and includes:

  • Access to Gemini 1.5 Pro

  • Tools like Flow, NotebookLM

  • Higher rate limits and expanded capabilities

Perfect for professionals and creators who want more power and precision in their AI workflows.

🔸 Google AI Ultra – $249.99/month

The Ultra plan is designed for power users, enterprise teams, and early adopters. It includes:

  • Access to Gemini’s most powerful models, like Veo 3

  • Early access to experimental features

  • 2.5 Pro Deep Think mode coming soon

  • First access to Agent Mode (Gemini’s task orchestrator)

For your first three months, new users get 50% off.


8. Agent Mode (Coming Soon)

Agent Mode is one of Gemini’s most exciting upcoming features. It will allow you to set high-level goals and let Gemini execute complex, multi-step tasks for you.

Imagine:

  • “Plan and book a weekend trip to Yosemite”

  • “Create a website for my photography portfolio”

  • “Find and summarize the latest research on climate policy”

Agent Mode will use live browsing, app integrations, and intelligent planning to handle tasks end-to-end with minimal effort from you.


Final Thoughts: Gemini Is Becoming the Ultimate AI Assistant

From live video help to deep research, from stunning media generation to intelligent automation, the Gemini app is quickly becoming the most powerful AI tool available for everyday users.

Whether you're studying for exams, building a startup, managing a team, or creating art, Gemini is designed to meet you where you are — and help you go further.

✨ Try Gemini today at gemini.google.com


📌 Quick Summary of What's New in Gemini:

  • Gemini Live: Free real-time help via camera or screen sharing

  • Imagen 4: Hyper-realistic image generation

  • Veo 3: Video generation with sound and dialogue

  • Deep Research: Now supports personal PDFs/images

  • Canvas: Build interactive content and applications

  • Gemini in Chrome: Ask questions while browsing

  • Interactive Quizzes: Personalized learning support

  • New Plans: Google AI Pro ($19.99/mo) & Ultra ($249.99/mo)

  • Agent Mode: Intelligent task execution coming soon

Let your ideas take flight with Gemini.


If you'd like this formatted as a downloadable blog post or need a version for publishing to platforms like Medium or LinkedIn, let me know!

নবীনতর পূর্বতন

যোগাযোগ ফর্ম