Project Astra – An AI Agent That Can Stay Both in Your Pocket & Glasses

Srishti Panwar
Google Project Astra universal AI agent.

Have you ever thought what it would be like having an assistant, all to yourself – who can communicate with you, get you answers to questions in real-time, and literally be on its toes just to cater to your needs 24/7. 🕰️

Well, a human assistant might not be possible but looks like Google has got something for you just as close but for free! 

No, it’s not the typical Siri and Alexa kind of assistant, this one is unique. As the Co-founder and CEO of DeepMind, Demis Hasabbis would say – The future of AI Assistants. 

For a long time, we have been aiming to  build a universal AI agent that can be truly helpful in everyday life.

Demis Hasabbis, Co-Founder & CEO – DeepMind 

Demis Hasabbis
Demis Hasabbis, Co-Founder & CEO – DeepMind

For Google bringing AI to life wasn’t a vision they thought of yesterday, it has been in the talks and building for over a decade now. Gemini, which has been a multimodal AI assistant since the beginning itself, is a result of that. 

Gemini was a transformative experience in itself, but with Project Astra, we are moving a step further and hopefully entering the new era of AI assistants, from little that is there about Project Astra. 

Google Project Astra.
Demis Hasabbis unveils the Project Astra and its vision at Google I/O ‘24. [Source – Google Youtube]

What is Project Astra? 

Astra (अस्त्र) is a Sanskrit word picked up from hindu mythology with the meaning, ‘supernatural weapon.’ I see the effect that Google was going for here, and by looking at the tech, the name seems quite promising as per the features of this AI assistant as well. 

“An agent like this has to respond to our complex, dynamic, and ever changing world, just like we do,” says Hasabbis in Google I/O ‘24. He further adds that it would need to take in and remember what it has seen so it can understand the context and take action. 

To do so, it would have to be proactive, teachable, and personal, so you can talk to it naturally without any delay or lag. 

That’s exactly what Project Astra aims to do and has already done to some degree. 

In simple words, Project Astra is a universal AI agent built by Google which can answer your queries in real-time through images and videos that it sees and can also remember things that it has seen within the surroundings or talked about. 

Gemini Project Astra is ‘almost’ human 🫀

Built on Google’s Gemini models and AI trends, Project Astra is more or less an extension of Gemini or it could be called ‘more human’ Gemini. 

Whilst it is still an AI technology talking, its advanced features can also give you a false sense of companionship, which might be fortunate or unfortunate – we are yet to find out. 

Till then, here is all you need to know about AI assistance. 

Just like a weapon, it makes you feel equipped 

Project Astra lives by its name, and is different from the already existing AIs and voice assistants as stated by the journalist Anuj Bhatia, who got to give it a try. 

“Astra’s capabilities go beyond what we have seen in existing AI assistants. The Google researchers told me that Astra uses built-in “memory,” meaning after it scans the objects, it could still “remember” where specific items were placed.”

Anuj Bhatia, journalist 

Journalist Anuj Bhatia
Journalist Anuj Bhatia

Along with providing solutions to your queries it can also remember things for you. While the memory window of Astra is right now very small, it is yet helpful, and if it gets expanded, then the possibilities are insane. You probably won’t have to worry about where your keys are. 

It can stay in your phone and also glasses 

When Astra would be accessible, it would see the world with you, just as you see, and also learn from it. This is an AI assistant that you can access not just from your phone but also through smart glasses. 

Sounds hard to believe? Have a look while Google shows off Astra through Meta’s smart Ray bans. 

Project Astra in smart glasses.
Project Astra being used within smart glasses to resolve a query as seen on the whiteboard. [Source – Google Youtube]

What can Project Astra do for you? 

The better question is, what can AI not do for you

Astra is based on the most advanced and modern AI tech which includes generative AI, multimodal AI, and computer vision. 

Using computer vision, your phone’s camera or through your glasses, it captures the image, or what’s there in front of it, then amalgamates and applies multimodal and generative AI to answer your questions. 

As simple as it sounds right now, the tech in the back end is quite the opposite. But this is what you need to know about its functioning at the surface level. 

Keeping that in mind, here are all the things Astra can do for you, some of them surprised me as well. 

Project Astra understands quantum physics.
Astra can even understand the concepts of quantum physics through a roughly drawn image. [Source – Google Youtube]

Explain Physics Drawings

As shocking as it might sound, Astra is capable of understanding even poorly drawn images which are related to Physics. Be it a roughly drawn Albert Einstein or an image describing the application of gravity. 

It will be able to recognize those, and answer accordingly as well. 

I wish I had access to Astra in my high school!  

Can solve mathematical problems

Astra sounds like a real geek because not just physics but it can solve mathematics problems as well that too within seconds. From questions involving factorization to understanding graphs, and being able to comprehend them – Astra can really do it all. 

Recognize things/landmarks/monuments/literatures from drawings

Your poorly drawn sketches will not be scrutinized or made fun of for being unrecognizable. 

You know why? 

Because even if your Taj Mahal does not have swiftly drawing edges or even if your zebra looks more like a stick figure without structure, Astra would be able to figure out what it is. You have got a cheerleader for yourself. 

That is not it, but through the sketches it can also make out which literature or story you are talking about. Even your stick figures of Romeo and Juliet with a poison bottle and a dagger would be recognized by Astra as the famous play of Shakespeare. 

Tell your location through the surroundings 

Among all the things that Astra can do, the most surprising one was that it can share your location just by a look at your surroundings and where you are standing. Now that is brilliant not just from the perspective of how advanced it is but also from the eye of safety. 

You will know which neighborhood you are in even if it is the most random street with buildings. Now that might be a hyperbole, but from what we see right now, it feels real, and very possible.   

Gemini Project Astra can confirm location.
Project Astra can tell where you are located just based on the imageries of your surrounding area. [Source – Google Youtube]

It can converse naturally 

Many journalists, who got a demo of Astra, including Anuj Bhatia and Kerry Wan, say that it is far from human, but they also recognize that this ‘most human AI’, we have got so far, which I believe is a huge leap from generative AI tools like ChatGPT

I agree with the fact that it is not a human, and it can never be, but I also acknowledge everything that it brings to the table as an AI. Top of them being its tone and way of talking. 

“​​Perhaps more notable was how natural the AI sounded as I panned the Pixel 8 Pro camera around and asked random questions about various objects in the room. The natural-sounding voice goes hand in hand with the Storyteller and Pictionary capabilities, both of which keep children, students, and people who have time to spare entertained.”

Kerry Wan, Senior Reviews Editor at ZDNET

Kerry Wan
Kerry Wan, Senior Reviews Editor at ZDNET

While Astra’s conversational skills are earning it some brownie points, let’s nor forget that its response timing is praiseworthy as well. 

DeepMind’s team shared that engineering something to be conversational is a huge challenge, keeping that in mind, it is commendable how talking to Project Astra feels like talking to a fellow peer. The response time is significantly better than other AI tools and you can feel that the conversation is flowing. 

Project Astra AI agent technology.
This is the tech behind the conversational nature of Project Astra. It was tough, but Google DeepMind’s team made it happen. [Source – DeepMind]

Project Astra AI agent is just the beginning 🌌🚀

Demis Hasabbis gearing up with Project Astra before Google I/O ‘24 event. 

“I think you will agree it’s amazing to see how far AI has come especially when it comes to spatial understanding, video processing, and memory. It is easy to envision a future where you can have an expert by your side, though your phone.”

Demis Hasabbis, Co-Founder & CEO – DeepMind 

While Project Astra is an exciting venture that we are all looking forward to, it is also just the beginning. It definitely has insane potential, but there are some curvy edges that still need to be sharpened for it to make it through the real world. 
Either way, I am looking forward to getting my hands on it and making AI work for me.

Picture of Srishti Panwar

Srishti Panwar

I am a Founding Member and Head of Content at Unrola. I am a writer during the day and a reader at night, and if I am not doing either then you can find me working out.