Skip to content

How to Set Up Automatic Document Ingestion via Email

Many devices (scanners, printers, fax services, and apps) can send documents as email attachments. This guide explains how to automatically route those attachments into DocuElevate.


Overview

The Email Ingestion workflow works like this:

Scanner/Device → Email (SMTP) → Monitored Mailbox → DocuElevate API → Processing & Storage

DocuElevate periodically checks a designated email inbox, downloads PDF/image attachments, and processes them through the standard document pipeline.


Prerequisites

  • An email account dedicated to document ingestion (e.g., scan@yourdomain.com)
  • IMAP access enabled for that account
  • DocuElevate running with the Celery worker active

Configuration

Add the following to your DocuElevate .env file:

# Email ingestion settings
EMAIL_INGESTION_ENABLED=true
EMAIL_INGESTION_IMAP_HOST=mail.yourdomain.com
EMAIL_INGESTION_IMAP_PORT=993
EMAIL_INGESTION_IMAP_SSL=true
EMAIL_INGESTION_USERNAME=scan@yourdomain.com
# Use an app-specific password (Gmail, Outlook) – NOT your main account password.
# See the Security Considerations section below for details.
EMAIL_INGESTION_PASSWORD=your-app-specific-password
EMAIL_INGESTION_FOLDER=INBOX
EMAIL_INGESTION_INTERVAL=60           # Check every 60 seconds
EMAIL_INGESTION_MARK_SEEN=true        # Mark emails as read after processing
EMAIL_INGESTION_ALLOWED_SENDERS=      # Comma-separated allowlist (empty = allow all)

Restart DocuElevate after saving the configuration:

docker compose restart api worker

Supported File Types

DocuElevate will process the following attachment types from emails:

Type Extension Notes
PDF .pdf Native support; OCR applied if not searchable
JPEG/PNG .jpg, .jpeg, .png Converted to PDF before processing
TIFF .tif, .tiff Common format from older scanners/fax
Multi-page TIFF .tif Full multi-page support

Controlling Which Attachment Types Are Ingested

By default, DocuElevate only ingests document attachments (PDFs, Word, Excel, PowerPoint, OpenDocument, RTF, TXT, CSV, HTML, Markdown). Images are not ingested by default — this prevents cluttering your document archive with inline images or unrelated photo attachments.

Global Default (Admin Setting)

Set the IMAP_ATTACHMENT_FILTER environment variable to control the system-wide fallback when no ingestion profile is assigned to a mailbox:

Value Behaviour
documents_only (Default) Only PDFs and office/document files. Images are skipped.
all All supported file types, including images.
IMAP_ATTACHMENT_FILTER=documents_only

Ingestion Profiles (Fine-Grained Per-Mailbox Control)

For precise control, you can create Ingestion Profiles that let you pick exactly which file-type categories to accept from each mailbox. This is more powerful than the binary global toggle and works independently per mailbox.

Available categories:

Category File types included
PDF .pdf
Microsoft Office .doc, .docx, .xls, .xlsx, .ppt, .pptx, and macro-enabled variants
OpenDocument .odt, .ods, .odp, .odg, .odf (LibreOffice / OpenOffice)
Text & Data .txt, .csv, .rtf
Web & Markup .html, .htm, .md, .markdown
Images .jpg, .png, .gif, .bmp, .tiff, .webp, .svg

Managing profiles:

  1. Go to Email Ingestion (/imap-accounts)
  2. Click Manage profiles (or the + icon next to the profile dropdown)
  3. Create a new profile, give it a name, and tick the categories you want
  4. When adding or editing a mailbox, select your profile from the dropdown

Two built-in profiles are always available and cannot be deleted:

  • Documents Only — PDF, Office, OpenDocument, Text, Web (no images)
  • All Files — all categories including images

Users can also create unlimited custom profiles to mix and match exactly the categories they need per mailbox (e.g. a scanner mailbox that only accepts PDFs, or a finance mailbox that accepts Office and CSV but not images).

Custom profiles are created via the UI or the /api/imap-profiles/ API.


Setting Up Your Scanner/Device

HP Printers – Scan to Email

See the detailed guide: HP Enterprise Printer Setup

Fujitsu ScanSnap – Send by Email

See the detailed guide: ScanSnap Setup

iOS/Android Scanning Apps

Most mobile scanning apps (Adobe Scan, Microsoft Lens, SwiftScan) can email scans:

  1. Scan your document.
  2. Use the app's Share or Send function.
  3. Select Email and enter scan@yourdomain.com.
  4. DocuElevate will pick up the attachment within the configured interval.

Fax-to-Email Services

Services like eFax, RingCentral Fax, or Twilio Fax can forward incoming faxes as email attachments. Configure them to send to scan@yourdomain.com.


Security Considerations

Important: Only process emails from trusted sources to avoid ingesting malicious documents.

Use the EMAIL_INGESTION_ALLOWED_SENDERS setting to restrict which email addresses can submit documents:

EMAIL_INGESTION_ALLOWED_SENDERS=scanner@office.com,printer@office.com,fax@office.com

DocuElevate will silently skip emails from addresses not in the allowlist.

Additionally:

  • Use a dedicated email account solely for document ingestion
  • Enable app-specific passwords (Gmail, Outlook) instead of your main account password
  • Store credentials in environment variables, never in config files committed to version control

Monitoring

Check the DocuElevate worker logs to verify email ingestion is running:

docker logs document_worker --follow

You should see log entries like:

INFO  Email ingestion: checking inbox scan@yourdomain.com
INFO  Email ingestion: found 3 new messages
INFO  Email ingestion: processing attachment invoice-2024.pdf from printer@office.com
INFO  Email ingestion: queued document ID 142 for processing

Troubleshooting

No emails are being processed? → Verify IMAP credentials and that IMAP is enabled on your mail server.
→ Check firewall rules: port 993 (SSL) or 143 (plain) must be open from DocuElevate to the mail server.

Gmail not working? → Enable "App Passwords" in your Google Account security settings.
→ Use the App Password (not your main Google password) for EMAIL_INGESTION_PASSWORD.

Attachments processed but files are empty? → Some email clients send inline images instead of attachments. Check the raw email source.

Emails keep getting re-processed? → Set EMAIL_INGESTION_MARK_SEEN=true to mark emails as read after processing.
→ Alternatively, configure a separate ingestion folder and move/delete emails after processing.