Word to PDF converts .docx files into PDF in the page itself, never on a server, with no server involvement. The tool offers two conversion engines you choose between with the toggle above — Direct Parse and HTML Render — each with different trade-offs. Both keep the file in tab memory the whole time — nothing is uploaded.
The Direct Parse engine reads the .docx ZIP archive (DOCX is XML-in-a-ZIP), parses the OOXML structure with a custom parser, runs deterministic layout, and writes PDF primitives via pdf-lib. This route preserves text as text (selectable, searchable, copy-paste-able) and produces a small, vector-clean PDF, roughly 50–500 KB for a typical document. Fonts are mapped to the standard PDF font families — Calibri and Arial render as Helvetica, and Times New Roman is preserved — so the shapes are close but not pixel-identical to the original typeface. It is the better choice when the document uses standard paragraphs, headings, lists, simple tables, and inline images. Direct Parse deliberately rejects documents that use features it cannot reproduce faithfully (such as hyperlinks, complex field codes, equations, charts, SmartArt, or embedded fonts) with a clear message rather than producing a wrong-looking PDF — switch to HTML Render for those.
The HTML Render engine uses mammoth (DOCX → semantic HTML), renders the HTML inside an isolated iframe at 816×1056 pixels (US Letter at 96 DPI), captures each page with html2canvas, and embeds the resulting PNG images into a PDF using pdf-lib. Choose this engine for documents Direct Parse rejects, or when you want a closer visual match to how the document looks on screen. The output is a raster PDF — every page is an image, so text is no longer selectable and file size is significantly larger.
The conversion tries to preserve heading levels, paragraph styles, bold/italic/underline runs, ordered and unordered lists, simple tables, and inline images (PNG/JPEG embedded as PDF XObjects). It also detects title-like centered content via heuristics and applies the appropriate alignment. Note that hyperlinks are not preserved as clickable links — Direct Parse rejects documents containing them, and HTML Render flattens them into the page image.
Realistic limits: the .doc legacy binary format (Word 97–2003) is not supported — only .docx. Documents with complex floating shapes, equation editor (OMath), embedded charts, footnotes that span pages, multi-column layouts, headers/footers with field codes (PAGE OF NUMPAGES), and SmartArt are rejected by Direct Parse and may render imperfectly under HTML Render. For those documents, the most reliable conversion is still Microsoft Word's built-in Save As PDF or LibreOffice in headless mode.
Image fidelity in the direct pipeline matches the source — embedded images are pulled from the .docx ZIP and re-embedded into the PDF without re-encoding, so quality is preserved. The HTML pipeline rasterizes everything at 96 DPI, which is fine for screen reading but soft when zoomed or printed at full Letter size.
Browser memory is the practical limit. .docx files up to ~30 MB and 200 pages convert comfortably. Documents with hundreds of high-resolution embedded images can pressure the tab's heap during the html2canvas pass; in those cases, the direct pipeline is dramatically lighter and usually succeeds where the raster pipeline fails.