17 DAYS AGO • 4 MIN READ

The Voice AI That Actually Helps

profile

Nahid's Notebook

I share simple, practical tips on AI and AI agents to help creators and businesses work smarter every day.

Episode 16:
Your AI Voice Assistant

nahiddotai

Wed Sep 3rd

Hey friends,

Picture this: It’s 7 AM, I’m on my usual morning walk, and I’m having the most productive conversation of my day.

Not with a human. With AI.

I pull out my phone, tap ChatGPT, and start talking through my latest newsletter ideas while the world wakes up around me.

It responds instantly and helps me refine concepts in real-time - all while sounding perfectly human.

It gets better: Last year while travelling in Bali, I realized this wasn’t just convenient, it was magical.

All through natural conversation with an AI that actually understood what I needed.

This wasn’t Siri struggling with my accent or Google Assistant giving me robotic responses.

This was ChatGPT’s Advanced Voice Mode — and it’s completely changed how I think about voice AI. And guess what?

I’m doing it all again, while travelling in Japan as you read this 🇯🇵

Let me take you through it.


🎯 What Actually Makes Advanced Voice Mode Different?

Let me be blunt: This is what Siri was supposed to be all along.

Advanced Voice Mode combines speech, video, images, and screen sharing into a seamless, dynamic experience, creating conversations that feel genuinely natural.

Here’s what sets it apart:

Real-time visual understanding: Point your camera at anything and ask questions about what it sees, live.

Screen sharing capability: Share your screen and get instant help with whatever you’re looking at.

Multilingual: I spoke to it in Bengali (my second language). It responded flawlessly, maintaining context and cultural nuance.

Therapeutic conversations: Honestly, talking through ideas and problems feels surprisingly therapeutic.

No robotic responses. No “I didn’t understand that.”

Just natural, intelligent conversation.


🛠 4 Ways I’m Actually Using Advanced Voice Mode (That You Can Try Today)

Let me share the specific workflows that have become part of my routine:

Use Case 1: My Walking Think Buddy

Whenever I take a morning walk, I use it as my “brainstorming session.”

I open Advanced Voice Mode and talk through:

  • Content strategies
  • Problem-solving for projects
  • Newsletter ideas (like this one!)
  • Problems I’m facing that very day at work or personally

Pro tip: The AI remembers context throughout the conversation, so you can build on ideas naturally.

Use Case 2: Real-Time Tech Support

Few weeks ago I was staring at my phone settings, completely stumped.

My iPhone notifications were acting weird. Some apps weren’t showing up, others were duplicating. I had no idea where to even start looking.

Instead of Googling endlessly, I:

  1. Opened Advanced Voice Mode
  2. Shared my phone screen showing the notification settings
  3. Talked through what was happening while navigating the menus

ChatGPT saw exactly what I was seeing and guided me step-by-step.

Problem solved in under 2 minutes.

It was like having a tech-savvy friend looking over my shoulder, but better.

Use Case 3: Travel Companion

In Bali, I discovered Advanced Voice Mode isn’t just a translator—it’s a complete travel companion.

I open Advanced Voice Mode in ChatGPT and point my phone’s camera at the sign, and ask: “What does this say, and what do I need to know before entering?”

Within seconds, it’s reading the sign aloud, explaining the dress code, telling me about the cultural significance, and even suggesting the proper way to show respect.

But it gets better:

  • Point at street food and get ingredient breakdowns
  • Ask about local customs just by showing your surroundings
  • “What’s that building?” gets you history, not just identification

Right now, as you read this I’m using ChatGPT voice mode to get my self around Japan - the trains, shopping, signs are all wild. Will share more on this when I’m back.

Use Case 4: Visual Problem Solving

Here’s where it gets super helpful:

Point your camera at anything:

  • A broken appliance
  • A confusing diagram
  • Even your messy desk

And ask for help. I literally fixed up a broken hot water urn, simply by talking to it while pointing my camera.

It sees what you see and responds with relevant, actionable advice.


🚀 Your Quick Start Guide (Set This Up in 2 Minutes)

Transform your phone into the assistant you’ve always wanted.

Here’s exactly how:

  1. Download the ChatGPT mobile app (if you haven’t already)
  2. Open the app and tap the voice icon in the chat bar (not the microphone button, the one on the far right). This will open up a new screen with a circle orb in the middle and buttons along the bottom like video, image etc.
  3. Add the ChatGPT widget to your home screen AND lock screen for instant access (optional, but highly recommend)
  4. Choose your preferred voice from the nine available options

Pro tip: Free users now get a monthly preview of Advanced Voice Mode, so you can try it before committing to Plus.


🧠 Why This Matters More Than You Think

We’re at an inflection point.

Voice AI isn’t just getting better at understanding us.

It’s becoming genuinely helpful in ways we never imagined.

Think about it:

  • No more language barriers
  • No more typing long questions
  • No more switching between apps
  • No more waiting for “the right moment” to get help

The people who get comfortable with voice AI now will have a massive advantage.

Don’t be the person still typing when everyone else is talking.


🫡 Final Thought

Here’s something I didn’t expect:

Talking through problems out loud, even to an AI, is incredibly clarifying.

There’s something about verbalizing thoughts that makes complex ideas clearer, decisions easier, and stress more manageable.

It doesn’t replace human connection, but it’s remarkably effective for thinking out loud.

For years, we’ve been promised intelligent voice assistants.

Siri gave us timers and weather updates. Alexa gave us shopping lists and smart home controls.

But Advanced Voice Mode gives us something different: genuine conversation with an AI that can see, understand, and respond intelligently to our world.

This is more than just a feature.

It’s a preview of how we’ll interact with AI in the future.

Voice-first. Visual. Conversational. Helpful.

Get used to talking to AI now, because voice interaction is about to become as natural as texting.

Try Advanced Voice Mode this week and let me know what you discover.

Reply with the most surprising way you ended up using it.

Here’s to smarter conversations and better assistance,
Nahid

113 Cherry St #92768, Seattle, WA 98104-2205
Unsubscribe · Preferences

Nahid's Notebook

I share simple, practical tips on AI and AI agents to help creators and businesses work smarter every day.