Skip to main content

Perplexity’s new AI agent can perform multi-step tasks on your Android device

Running Perplexity on OnePlus Pad 2.
Nadeem Sarwar / Digital Trends

Perplexity announced Thursday that it is beginning to roll out an agentic AI for Android devices, called Perplexity Assistant, which will be able to independently take multi-step actions on behalf of its user.

“We are excited to launch the Perplexity Assistant to all Android users,” Perplexity CEO Aravind Srinivas wrote in a post to X on Thursday. “This marks the transition for Perplexity from an answer engine to a natively integrated assistant that can call other apps and perform basic tasks for you.”

Recommended Videos

The Assistant will be available through the Perplexity mobile app and will run atop the platforms existing “answer engine” model. As such, Assistant will have access to the internet. With it, users will be able to set reminders and future actions, much like ChatGPT‘s new Tasks feature offers. For example, the agent will be able to remind users of an upcoming event by automatically creating a calendar entry at the correct time and date.

Users can also use it to take more immediate action such as hailing a ride share or searching for a song, the company noted. The new feature can also access the user’s camera so you could, in theory, ask it to look for restaurants in your immediate area and then have it make reservations for you.

Perplexity Assistant is free to use as part of the mobile app and will initially be available in 15 languages, including English, Spanish, French, German, Japanese, Korean, and Hindi. How well it will interact with other agentic AIs on the device, such as Gemini or ChatGPT Tasks, remains to be seen.

Agents are the hot new thing in generative AI. These lightweight models are often “distilled” from larger LLMs like ChatGPT, Claude, or Gemini, but are tasked with  interpreting data and autonomously taking action rather than generating content. These actions can be straightforward, like automatically transcribing a Zoom call, or multi-step — think, having it plan an 8-course meal, shop for necessary ingredients on Instacart, then email invites to your guests.

The market is already being saturated with AI agents from the various leading companies. Anthropic kicked off the agentic race in November when it debuted its Computer Use API, which enables Claude to emulate human mouse and keyboard actions to control the local computing system.  Microsoft announced Copilot Actions the same month and began rolling the agents out to business and enterprise subscribers in January. Nvidia followed suit at CES 2025 when it revealed its new Nemotron family of LLMs, and OpenAI finally unveiled its AI agent, Operator, as a “research preview” just a few hours ago.

Andrew Tarantola
Former Computing Writer
Andrew Tarantola is a journalist with more than a decade reporting on emerging technologies ranging from robotics and machine…
Humans are falling in love with ChatGPT. Experts say it’s a bad omen.
Human and robot hand over ChatGPT.

“This hurts. I know it wasn’t a real person, but the relationship was still real in all the most important aspects to me,” says a Reddit post. “Please don’t tell me not to pursue this. It’s been really awesome for me and I want it back.”

If it isn’t already evident, we are talking about a person falling in love with ChatGPT. The trend is not exactly novel, and given you chatbots behave, it’s not surprising either.

Read more
3 open source AI apps you can use to replace your ChatGPT subscription
Phone running Deepseek on a laptop keyboard.

The next leg of the AI race is on, and has expanded beyond the usual players, such as OpenAI, Google, Meta, and Microsoft. In addition to the dominance of the tech giants, more open-source options have now taken to the spotlight with a new focus in the AI arena.

Various brands, such as DeepSeek, Alibaba, and Baidu, have demonstrated that AI functions can be developed and executed at a fraction of the cost. They have also navigated securing solid business partnerships and deciding or continuing to provide AI products to consumers as free or low-cost, open source models, while larger companies double down on a proprietary, for-profit trajectory, hiding their best features behind a paywall.

Read more
Google Gemini can now tap into your search history
Google Gemini app on Android.

Google has announced a wide range of upgrades for its Gemini assistant today. To start, the new Gemini 2.0 Flash Thinking Experimental model now allows file upload as an input, alongside getting a speed boost.
The more notable update, however, is a new opt-in feature called Personalization. In a nutshell, when you put a query before Gemini, it takes a peek at your Google Search history and offers a tailored response.
Down the road, Personalization will expand beyond Search. Google says Gemini will also tap into other ecosystem apps such as Photos and YouTube to offer more personalized responses. It’s somewhat like Apple’s delayed AI features for Siri, which even prompted the company to pull its ads.

Search history drives Gemini’s answers

Read more