PDF to Excel Extractor – Local Edition
$5
$5
https://schema.org/InStock
usd
PM The Tech Guy
Instantly extract your documents' names, emails, and organizations securely and locally.
This tool utilizes spaCy's AI to retrieve structured data from PDF, DOCX, and TXT files, converting it into clean, ready-to-use Excel spreadsheets. No cloud processing. No API keys.
What This Tool Does:
- Scans all files in your
input/
folder - Extracts:
- Names (PERSON)
- Organizations (ORG)
- Email addresses
- Generates a
.xlsx
spreadsheet in theoutput/
folder - Works offline — no data leaves your computer
Requirements:
- Python 3.8 to 3.11
- Basic understanding of how to run a Python script
What's Included:
- Fully structured Python app (
main.py
+ utils) - Clean and editable folder setup
- PDF README with install and usage instructions
- Support via: support@pmthetechguy.com
Note: This tool employs spaCy's general-purpose English model. It performs effectively for most documents but may not accurately identify all entities. A future professional version might incorporate improved models.
A local Python tool that extracts names, emails, and organizations from PDFs into Excel, powered by spaCy.
Local-Only File Processing
No need to upload your files to the cloud. This tool runs entirely on your computer for maximum privacy and control.
Clean Excel Output
Your extracted data is saved as a structured .xlsx file with separate columns for Names, Emails, and Organizations.
Beginner-Friendly Setup
Includes a step-by-step PDF guide for installation and usage. Just drop files in the input folder and run main.py.
Size
299 KB
Add to wishlist