Skip to main content

How to use ChatGPT to analyze PDFs

The chatGPT chat screen with an uploaded image, on an Acer, on a bench, on a deck.
Andrew Tarantola / Digital Trends

Thanks to its advanced vision capabilities, ChatGPT can provide you with in-depth analysis and summarization of images and documents alike. This can be especially handy when you have a research paper or legal documents spanning dozens of pdf pages. Why go through all that trouble to parse them yourself when you can simply have ChatGPT do it for you? In this guide, you'll see just how easy it is to upload PDFs to ChatGPT.

Recommended Videos

Difficulty

Easy

Duration

5 minutes

Upload a PDF to ChatGPT

Step 1: Log into ChatGPT. Open your web browser and navigate to ChatGPT.com and click the Sign In button in the lower-left corner. Enter your credentials as needed.

the chatgpt homescreen
Digital Trends

Step 2: Attach the PDF to the prompt window. Click the paperclip icon next to the text input field, select where you want the document sourced from — you can select from the local hard drive, Google Drive, or Microsoft OneDrive — and click the file you want to attach.

ChatGPT upload source
Digital Trends

Step 3: Enter your query. Once the PDF is attached, type your query, question, directions, or what have you into the prompt field. In this case, I'm asking the system how much more pork I'd need if I wanted to triple the serving size of the recipe. Click the upward-facing arrow button on the right edge of the prompt window to upload everything to ChatGPT's servers, and give the system time to do its analysis.

Chatgpt home screen with a PDF attached to the prompt field
Digital Trends
Andrew Tarantola
Andrew Tarantola is a journalist with more than a decade reporting on emerging technologies ranging from robotics and machine…
ChatGPT’s latest model may be a regression in performance
chatGPT on a phone on an encyclopedia

According to a new report from Artificial Analysis, OpenAI's flagship large language model for ChatGPT, GPT-4o, has significantly regressed in recent weeks, putting the state-of-the-art model's performance on par with the far smaller, and notably less capable, GPT-4o-mini model.

This analysis comes less than 24 hours after the company announced an upgrade for the GPT-4o model. "The model’s creative writing ability has leveled up–more natural, engaging, and tailored writing to improve relevance & readability," OpenAI wrote on X. "It’s also better at working with uploaded files, providing deeper insights & more thorough responses." Whether those claims continue to hold up is now being cast in doubt.

Read more
ChatGPT just improved its creative writing chops
a phone displaying the ChatGPT homepage on a beige bbackground.

One of the great strengths of ChatGPT is its ability to aid in creative writing. ChatGPT's latest large language model, GPT-4o, has received a bit of a performance boost, OpenAI announced Wednesday. Users can reportedly expect "more natural, engaging, and tailored writing to improve relevance & readability" moving forward.

https://twitter.com/OpenAI/status/1859296125947347164

Read more
ChatGPT already listens and speaks. Soon it may see as well
ChatGPT meets a dog

ChatGPT's Advanced Voice Mode, which allows users to converse with the chatbot in real time, could soon gain the gift of sight, according to code discovered in the platform's latest beta build. While OpenAI has not yet confirmed the specific release of the new feature, code in the ChatGPT v1.2024.317 beta build spotted by Android Authority suggests that the so-called "live camera" could be imminently forthcoming.

OpenAI had first shown off Advanced Voice Mode's vision capabilities for ChatGPT in May, when the feature was first launched in alpha. During a demo posted at the time, the system was able to identify that it was looking at a dog through the phone's camera feed, identify the dog based on past interactions, recognize the dog's ball, and associate the dog's relationship to the ball (i.e. playing fetch).

Read more