Supported File Types
Phinder identifies PII in both the file contents and the file name.
| Category | Extensions | Notes |
|---|---|---|
| Plain Text | .txt, .md |
Markdown is supported. |
| Documents | .pdf, .docx, .doc, .rtf |
|
| Spreadsheets | .xlsx, .xls, .csv |
CSV delimiter and quoting can be customized. |
| Presentations | .pptx, .ppt |
Extracts from slides, notes, and comments. |
.eml, .msg |
Extracts subject, sender, and body. | |
| Logs | .log |
Processed line-by-line for memory efficiency. |
| Images | .png, .jpg, .jpeg |
Requires Tesseract OCR installed. |
Note
Processing images requires tesseract-ocr to be installed on your system.