DocTo
Simple command line utility for converting .doc & .xls files to any supported format such as Text, RTF, CSV or PDF
Install / Use
/learn @tobya/DocToREADME
DocTo
Document Converter
Simple utility for converting a Microsoft Word Document '.doc', Microsoft Excel '.xls' and Microsoft Powerpoint .ppt files to any other supported format such as .txt .csv .rtf .pdf.
Can also be used to convert .txt, .rtf, .csv to .doc, .xls or .pdf format.
Can be used to convert older word documents to latest format.
Must have Microsoft Word, Excel or Powerpoint installed on host machine.
Download Release From Github Releases - https://github.com/tobya/DocTo/releases/ Further Information available at https://tobya.github.io/DocTo/
Features
- Convert Doc/RTF/Text file to any Word SaveAs Type Doc/Text/RTF/PDF
- Convert XLS/XLSX/CSV file to any Excel SaveAs Type CSV/Text/PDF
- Convert Text/CSV file to full fledged Word or Excel format.
- Single File Conversion
- Multiple / Directory File Conversion.
- Delete after conversion
- Fire https Webhook on each conversion.
Examples
More Examples available at
Installation
Download .exe from Release https://github.com/tobya/docTo/releases
Package Managers
Choco
Also Available for installation via Chocolatey
choco install docto
to upgrade to latest version
choco upgrade docto
Contributers
I am actively looking for contributers so if you are a
- Delphi (or VBA) programmer
- PHP, Laravel, Pest Programmer (tests and documentation)
and interested in helping out, please send me a message or comment.
Docto.Works Conversion Website
I have recently launched Docto.Works which uses DocTo to allow anyone from anywhere to convert Word Documents to pdfs, htmls and text files. I will be adding additional functionality as it goes, but wanted to release with 1.16 . There is a conversion token system to use it, but you get 250 tokens on signup so start using and let me know what you think in the discussion.
DocTo in other applications
Node
Node Wrappers has been created by @KerimG & @brrd
https://www.npmjs.com/package/node-docto
https://github.com/brrd/msoconvert
@opendevise/antora-office-to-pdf-extension
Other Integrations
Bugs and Features
Please log an issue for any bugs, features or suggestions.
Examples
Single
Convert Microsoft Word Document to text
docto -WD -f C:\Directory\MyFile.doc -O "C:\Output Directory\MyTextFile.txt" -T wdFormatText
Convert Microsoft Excel Document to csv text
docto -XL -f C:\Directory\MyFile.xls -O "C:\Output Directory\MyTextFile.csv" -T xlCSV
Convert Microsoft Word Document to PDF (requires version of Microsoft Word that supports this).
docto -f C:\Directory\MyFile.doc -O "C:\Output Directory\MyTextFile.pdf" -T wdFormatPDF
Multiple Files and Folders
Convert All Microsoft Word Documents in Directory and its Sub Directories to PDF
docto -f "C:\Dir with Spaces\FilesToConvert\" -O "C:\DirToOutput" -T wdFormatPDF -OX .pdf
Delete Original File after Conversion
Delete Original Files after conversion (-R) .
docto -f "C:\Dir with Spaces\FilesToConvert\" -O "C:\DirToOutput" -T wdFormatPDF -OX .pdf -R true
Webhooks
Add a Webhook to fire on each conversion (-W)
docto -f "C:\Dir with Spaces\FilesToConvert\" -O "C:\DirToOutput" -T wdFormatPDF -OX .pdf -W https://toflidium.com/webhooks/docto/webhook_test.php
A Webhook is a url that can be called on each converstion to give you the ability to repond externally whenever a file is converted.
Use in the Wild
If you are using DocTo in the wild somewhere, please add details to this wiki page
OneDrive Conversion
If you need to upgrade a bunch of files to work without conversion on OneDrive /Office365 / Word 20XX then you can use DocTo. See this wiki article
https://github.com/tobya/DocTo/wiki/OneDrive-Conversion
Command Line Help
Help
DocTo Version: %s
Office Version: %s
Open Source: https://github.com/tobya/DocTo/
Description: DocTo converts Word Documents and Excel Spreadsheets to other formats.
Command Line Parameters:
Each Parameter should be followed by its value eg
-f "c:\Docs\MyDoc.doc"
Parameters markers are case insensitive.
-H This message
--HELP -?
-WD Use Word for Conversion (Default). Help '-h -wd'
--word
-XL Use Excel for Conversion. Help '-h -xl'
--excel
-PP Use Powerpoint for Conversion. help '-h -pp'
--powerpoint
-VS Use Visio for Conversion.
--visio
-F Input File or Directory
--inputfile
-FX Input Extension to search for if directory. (.rtf .txt etc)
Default ".doc*" (will find ".docx" also)
--inputextension
--inputfilter
Filter Files to input. Property*.doc will match Property1.doc,
Property2.doc etc
-O Output File or Directory to place converted Docs
--outputfile
-OX Output Extension if -F is Directory. Please include '.' eg. '.pdf' .
If not provided, pulled from standard list.
--outputextension
-T Format(Type) to convert file to, either integer or wdSaveFormat constant.
Available from
https://docs.microsoft.com/en-us/dotnet/api/microsoft.office.interop.word.wdsaveformat
or https://docs.microsoft.com/en-us/dotnet/api/microsoft.office.interop.excel.xlfileformat
See current List Below.
--format
-TF Force Format. -T value if an integer, is checked against current list
compiled in. It is not passed if unavailable. -TF will pass through value
without checking. Word will return an "EOleException Value out of range"
error if invalid. Use instead of -T.
--forceformat
-L Log Level Integer: 1 ERRORS 2 STANDARD 5 CHATTY 9 DEBUG 10 VERBOSE. Default: 2=STANDARD
--loglevel
-C Compatibility Mode Integer. Set to an INTEGER value from
https://msdn.microsoft.com/en-us/library/office/ff192388.aspx.
Set the compatibility mode when you want to convert documents to a later
version of word. See help '-h -c' for further info.
--compatibility
-E Encoding Integer: Sets codepage Encoding. See
https://msdn.microsoft.com/en-us/library/office/ff860880.aspx
for more details and values.
--encoding
-M Ignore all files in __MACOSX\ subdirectory if it exists. Default True.
--ignoremacos
-N Make list of files that take over n seconds to complete.
Use number of seconds over that conversion takes and add to list.
Outputs to filename 'docto.ignore.txt'
--listlongrunning
-NX Ignore any file listed in docto.ignore.txt, created by -N
--ignorelongrunninglist
-G Write Log to file in directory
--writelogfile
-GL Log File Name to Use. Default 'DocTo.Log';
--logfilename
-Q Quiet Mode: Nothing will be output to console. To see any errors you must
set -G or -GL. Equivalent to setting -L 0
--quiet
-R Remove Files after successful conversion: Default false; To use specify
value eg -R true
--deletefiles
-W Webhook: Url to call on events. See help '-H -HW' for more details.
--webhook
-X Halt on COM Error: Default True; If you have trouble with some files
not converting, set this to false to ignore errors and continue with
batch job.
--halterror
-V Show Versions. DocTo and Word/Excel/Powerpoint
Long Parameters:
--BookmarkSource
PDF conversions can take their bookmarks from
WordBookmarks, WordHeadings (default) or None
--DoNotOverwrite
--no-overwrite
Existing files are overridden by default, if you do not wish a file to be
over written use this option.
--no-subdirs Only convert specified directory. Do not recurse sub directories
--ExportMarkup Value for wdExportItem - default wdExportDocumentContent.
use wdExportDocumentWithMarkup to export all word comments with pdf
--no-IncludeDocProperties
--no-DocProp
Do not include Document Properties in the exported pdf file.
--PDF-OpenAfterExport
If you wish for a converted PDF to be opened after creation. No value req.
--PDF-FromPage
Save a range of pages to pdf. Integer/String. If integer --PDF-ToPage must also be set.
Other values wdExportCurrentPage, wdExportSelection
--PDF-ToPage
Save a range of pages to pdf. Integer. --PDF-FromPage must also be set.
--PDF-OptimizeFor
Set the pdf/xps to be optimized for print or screen.
Default ForPrint | ForOnScreen
--XPS-no-IRM
Do not copy IRM permissions to exported XPS document.
--PDF-No-DocStructureTags
Do not include DocStructureTags to help screen readers.
--PDF-no-BitmapMissingFonts
Do not bitmap missing fonts, fonts will be substituted.
--use-ISO190051
Create PDF to the ISO 19005-1 standard.
Related Skills
summarize
340.5kSummarize or extract text/transcripts from URLs, podcasts, and local files (great fallback for “transcribe this YouTube/video”).
feishu-doc
340.5k|
obsidian
340.5kWork with Obsidian vaults (plain Markdown notes) and automate via obsidian-cli.
openhue
340.5kControl Philips Hue lights and scenes via the OpenHue CLI.
