Extract Content from PDF (OCR)
With the Text To Table Converter Add-On, you can perform Optical Character Recognition (OCR) on PDF files to extract all content—including text, paragraphs, lists, tables, and mathematical formulas—and insert it as fully editable elements directly into your Google Docs™, Google Slides™, or Google Sheets™ document.
Extract Content from PDF
This feature analyzes the layout and structure of each PDF page, intelligently converting visual elements into editable content while preserving the original formatting as Markdown.
-
Open the Extract Content from PDF tool Navigate through the Google Workspace™ menu:
Extensions
>Text To Table Converter
>🪄 PDF Tools
>Extract Content from PDF
. -
Select a PDF File The tool will open, prompting you to select a file. You can choose a PDF from your Google Drive™ or upload one directly from your computer.
-
Choose Pages to Extract Once a PDF is loaded, the tool will display a grid of thumbnail previews for every page in the document.
- Click on up to 3 pages you wish to extract content from. Selected pages will be highlighted.
- To see a larger preview of a page, double-click its thumbnail.
-
Extract the Content After selecting your pages, click the Extract Page(s) button. The add-on will process each selected page individually and insert the extracted content into your active document.
- The AI automatically identifies and converts all content types, including paragraphs, lists, and tables.
- Basic formatting such as bold, italics, and
code
is preserved.
Special Feature: LaTeX Formula Extraction
A key feature of the PDF extractor is its ability to recognize mathematical and scientific formulas and automatically convert them into standard LaTeX notation (e.g., $$E=mc^2$$
).
This plain text notation can then be instantly rendered into a high-quality equation image using the add-on’s built-in LaTeX tools, creating a seamless workflow from PDF to a perfectly formatted document.