Skip to main content

Perplexity’s new AI agent can perform multi-step tasks on your Android device

Running Perplexity on OnePlus Pad 2.
Nadeem Sarwar / Digital Trends

Perplexity announced Thursday that it is beginning to roll out an agentic AI for Android devices, called Perplexity Assistant, which will be able to independently take multi-step actions on behalf of its user.

“We are excited to launch the Perplexity Assistant to all Android users,” Perplexity CEO Aravind Srinivas wrote in a post to X on Thursday. “This marks the transition for Perplexity from an answer engine to a natively integrated assistant that can call other apps and perform basic tasks for you.”

Recommended Videos

The Assistant will be available through the Perplexity mobile app and will run atop the platforms existing “answer engine” model. As such, Assistant will have access to the internet. With it, users will be able to set reminders and future actions, much like ChatGPT‘s new Tasks feature offers. For example, the agent will be able to remind users of an upcoming event by automatically creating a calendar entry at the correct time and date.

Users can also use it to take more immediate action such as hailing a ride share or searching for a song, the company noted. The new feature can also access the user’s camera so you could, in theory, ask it to look for restaurants in your immediate area and then have it make reservations for you.

Perplexity Assistant is free to use as part of the mobile app and will initially be available in 15 languages, including English, Spanish, French, German, Japanese, Korean, and Hindi. How well it will interact with other agentic AIs on the device, such as Gemini or ChatGPT Tasks, remains to be seen.

Agents are the hot new thing in generative AI. These lightweight models are often “distilled” from larger LLMs like ChatGPT, Claude, or Gemini, but are tasked with  interpreting data and autonomously taking action rather than generating content. These actions can be straightforward, like automatically transcribing a Zoom call, or multi-step — think, having it plan an 8-course meal, shop for necessary ingredients on Instacart, then email invites to your guests.

The market is already being saturated with AI agents from the various leading companies. Anthropic kicked off the agentic race in November when it debuted its Computer Use API, which enables Claude to emulate human mouse and keyboard actions to control the local computing system.  Microsoft announced Copilot Actions the same month and began rolling the agents out to business and enterprise subscribers in January. Nvidia followed suit at CES 2025 when it revealed its new Nemotron family of LLMs, and OpenAI finally unveiled its AI agent, Operator, as a “research preview” just a few hours ago.

Andrew Tarantola
Andrew Tarantola is a journalist with more than a decade reporting on emerging technologies ranging from robotics and machine…
OpenAI is releasing an AI that can control your PC — if you cough up $200
The ChatGPT name next to an OpenAI logo on a black and white background.

OpenAI may be one step closer to releasing its agent tool, called Operator, which is on track for January 2024 availability.

The artificial intelligence company first announced the Operator AI agent in November 2024, explaining that the browser-based tool is autonomous and is able to complete tasks on a computer without human assistance. OpenAI added that Operator would be first available as a research preview within the $200 ChatGPT Pro subscription plan.

Read more
Everything you need to know about AI agents and what they can do
a hollow man under light

The agentic era of artificial intelligence has arrived. Billed as "the next big thing in AI research," AI agents are capable of operating independently and without continuous, direct oversight, while collaborating with users to automate monotonous tasks. In this guide, you'll find everything you need to know about how AI agents are designed, what they can do, what they're capable of, and whether they can be trusted to act on your behalf.
What is an agentic AI?
Agentic AI is a type of generative AI model that can act autonomously, make decisions, and take actions towards complex goals without direct human intervention. These systems are able to interpret changing conditions in real-time and react accordingly, rather than rotely following predefined rules or instructions. Based on the same large language models that drive popular chatbots like ChatGPT, Claude, or Gemini, agentic AIs differ in that they use LLMs to take action on a user's behalf rather than generate content.

AutoGPT and BabyAGI are two of the earliest examples of AI agents, as they were able to solve reasonably complex queries with minimal oversight. AI agents are considered to be an early step towards achieving artificial general intelligence (AGI). In a recent blog post, OpenAI CEO Sam Altman argued that, “We are now confident we know how to build AGI as we have traditionally understood it,” and predicted, "in 2025, we may see the first AI agents ‘join the workforce’ and materially change the output of companies.”

Read more
Microsoft introduces new ‘pay-as-you-go’ AI agents
microsoft copilot introduce ai agents free enterprise subscription tier m365 465350 blog 250110 1 1260

Microsoft will begin offering access to AI agents — specialized generative models that can operate independently and automate repetitive daily tasks — to enterprise users. The new program is called Microsoft 365 Copilot Chat and offers "pay-as-you-go agents to our existing free chat experience for Microsoft 365 commercial customers," the company announced Wednesday.

The "free plus metered agent usage" Microsoft 365 Copilot Chat offers many of the same features as the existing $30 per user per month "Microsoft 365 Copilot" enterprise program, including access to a chatbot powered by GPT-4o, Copilot Pages, file uploads, image and code generation, enterprise data protection, and, of course, to Copilot Studio, where individual users and IT departments alike can create AI agents. Note, however, that the free Chat program does not grant you access to the Copilot personal assistant, which integrates the AI's capabilities into the rest of the 365 Copilot app ecosystem such as Word, Outlook, and Excel.

Read more