Markitdown

Latest version: v0.1.1

Safety actively analyzes 723177 Python packages for vulnerabilities to keep your Python projects secure.

Scan your dependencies

Page 2 of 2

0.0.2

What's Changed
* Avoids resetting warning filters (addresses 1068) by afourney in https://github.com/microsoft/markitdown/pull/1101
* Removes deprecated features from 0.0.1aX (pre-release alphas) by afourney in https://github.com/microsoft/markitdown/pull/1105

**Full Changelog**: https://github.com/microsoft/markitdown/compare/v0.0.1...v0.0.2

0.0.1

Promoting v0.0.1a5 to a full release.

For more details see the prior [Release Notes](https://github.com/microsoft/markitdown/releases/tag/v0.0.1a5).

0.0.1a5

What's Changed
* Fixed compatibility with [markdownify v1.0.0](https://github.com/matthewwithanm/python-markdownify/releases/tag/1.0.0)
New Contributors
* lh0x00 made their first contribution in https://github.com/microsoft/markitdown/pull/1072

**Full Changelog**: https://github.com/microsoft/markitdown/compare/v0.0.1a4...v0.0.1a5

0.0.1a4

Some of What's Changed
* feat: Add RSSConverter by Soulter in https://github.com/microsoft/markitdown/pull/97
* feat: Add IpynbConverter by AumGupta in https://github.com/microsoft/markitdown/pull/71
* feat(devcontainer): Add DevContainer Configuration for Easier Contribution Setup by l-lumin in https://github.com/microsoft/markitdown/pull/64
* feat: add support for conversion via Document Intelligence by KennyZhang1 in https://github.com/microsoft/markitdown/pull/303
* feat: add version option to markitdown CLI by l-lumin in https://github.com/microsoft/markitdown/pull/172
* feat: enable Git support in devcontainer by numekudi in https://github.com/microsoft/markitdown/pull/136
* feat: outlook ".msg" file converter by muratcankurtulus in https://github.com/microsoft/markitdown/pull/196
* feat: Add xls support by yeungadrian in https://github.com/microsoft/markitdown/pull/169
* feat: support image description with LLM for pptx files by masquare in https://github.com/microsoft/markitdown/pull/306
* fix: Safeguard against path traversal for ZipConverter by finchy in https://github.com/microsoft/markitdown/pull/129
* fix: support -o param to avoid encoding issues by Soulter in https://github.com/microsoft/markitdown/pull/116
* fix(transcription): TRANSCRIPTION_CAPABLE should be iniztialized by absadiki in https://github.com/microsoft/markitdown/pull/194
* fix: added a test for leading spaces. by afourney in https://github.com/microsoft/markitdown/pull/258
* fix: If puremagic has no guesses, try again after ltrim. by afourney in https://github.com/microsoft/markitdown/pull/260
* fix: Recognize json as plain text (if no other handlers are present). by afourney in https://github.com/microsoft/markitdown/pull/261
* fix: Set exiftool path explicitly. by afourney in https://github.com/microsoft/markitdown/pull/267
* fix: remove leading and trailing \n for HtmlConverter by ZeyuTeng96 in https://github.com/microsoft/markitdown/pull/262
* fix: argparse CLI option ordering, fixes 268 by slhck in https://github.com/microsoft/markitdown/pull/290
* fix: for mimetype issue with csv files on windows. by wunde005 in https://github.com/microsoft/markitdown/pull/273
* docs: update README.md by eltociear in https://github.com/microsoft/markitdown/pull/182
* docs: Add documentation for docintel by KennyZhang1 in https://github.com/microsoft/markitdown/pull/312

New Contributors
* AumGupta made their first contribution in https://github.com/microsoft/markitdown/pull/71
* diya155 made their first contribution in https://github.com/microsoft/markitdown/pull/80
* l-lumin made their first contribution in https://github.com/microsoft/markitdown/pull/64
* waterimp made their first contribution in https://github.com/microsoft/markitdown/pull/98
* finchy made their first contribution in https://github.com/microsoft/markitdown/pull/129
* sugatoray made their first contribution in https://github.com/microsoft/markitdown/pull/130
* PetrAPConsulting made their first contribution in https://github.com/microsoft/markitdown/pull/91
* SigireddyBalasai made their first contribution in https://github.com/microsoft/markitdown/pull/93
* dependabot made their first contribution in https://github.com/microsoft/markitdown/pull/177
* numekudi made their first contribution in https://github.com/microsoft/markitdown/pull/136
* eltociear made their first contribution in https://github.com/microsoft/markitdown/pull/182
* absadiki made their first contribution in https://github.com/microsoft/markitdown/pull/194
* muratcankurtulus made their first contribution in https://github.com/microsoft/markitdown/pull/196
* yeungadrian made their first contribution in https://github.com/microsoft/markitdown/pull/169
* KennyZhang1 made their first contribution in https://github.com/microsoft/markitdown/pull/303
* ZeyuTeng96 made their first contribution in https://github.com/microsoft/markitdown/pull/262
* jamesmh made their first contribution in https://github.com/microsoft/markitdown/pull/270
* masquare made their first contribution in https://github.com/microsoft/markitdown/pull/306
* slhck made their first contribution in https://github.com/microsoft/markitdown/pull/290
* wunde005 made their first contribution in https://github.com/microsoft/markitdown/pull/273

**Full Changelog**: https://github.com/microsoft/markitdown/compare/v0.0.1a3...v0.0.1a4

0.0.1a3

Not secure
New Features and Formats

* Add zip handling by Josh-XT in https://github.com/microsoft/markitdown/pull/22
* Add PPTX chart support by nyosegawa in https://github.com/microsoft/markitdown/pull/33

Breaking Changes
Renamed `mlm_client ` and `mlm_model` arguments to `llm_client` and `llm_model`, and added appropriate deprecation warnings.

See:
* Fix LLM terminology in code by CharlesCNorton in https://github.com/microsoft/markitdown/pull/73
* Fix LLM terms by CharlesCNorton in https://github.com/microsoft/markitdown/pull/72
* Added deprecation warnings for mlm_* arguments. by afourney in https://github.com/microsoft/markitdown/pull/101

Bug fixes and enhancements
* Remove invalid classifiers by simonw in https://github.com/microsoft/markitdown/pull/10
* Add installation instructions from haesleinhuepf:patch-1 by gagb in https://github.com/microsoft/markitdown/pull/27
* Update README.md by gagb in https://github.com/microsoft/markitdown/pull/28
* Improve the readme with contributing guidelines by gagb in https://github.com/microsoft/markitdown/pull/7
* Add installation instructions by haesleinhuepf in https://github.com/microsoft/markitdown/pull/24
* Update README.md by pawarbi in https://github.com/microsoft/markitdown/pull/26
* Update README.md by gagb in https://github.com/microsoft/markitdown/pull/29
* CLI usage instructions by simonw in https://github.com/microsoft/markitdown/pull/11
* Fix character decoding issues with text-like files by brc-dd in https://github.com/microsoft/markitdown/pull/19
* Catching pydub's warning of ffmpeg or avconv missing by SH4DOW4RE in https://github.com/microsoft/markitdown/pull/39
* Exclude test files from language statistics using linguist-vendored by Y-Kim-64 in https://github.com/microsoft/markitdown/pull/44
* Support specifying YouTube transcript language by narumiruna in https://github.com/microsoft/markitdown/pull/50
* Add passing style_map kwarg to Mammoth when converting docx to allow keeping comments by VillePuuska in https://github.com/microsoft/markitdown/pull/38
* Fix: pass the kwargs to _convert method when converting an url file by Soulter in https://github.com/microsoft/markitdown/pull/48
* Added Dockerfile by madduci in https://github.com/microsoft/markitdown/pull/60
* fix issue 65 by DIMAX99 in https://github.com/microsoft/markitdown/pull/67
* Cybernobie/main by gagb in https://github.com/microsoft/markitdown/pull/75
* Ensure hatch is installed before running tests by cybernobie in https://github.com/microsoft/markitdown/pull/63
* Kevinclb/main by gagb in https://github.com/microsoft/markitdown/pull/77
* feature: add argument parsing for cli tool capability by kevinclb in https://github.com/microsoft/markitdown/pull/46
* Added llm tests to the local test set. by afourney in https://github.com/microsoft/markitdown/pull/100

New Contributors
* simonw made their first contribution in https://github.com/microsoft/markitdown/pull/10
* gagb made their first contribution in https://github.com/microsoft/markitdown/pull/27
* haesleinhuepf made their first contribution in https://github.com/microsoft/markitdown/pull/24
* pawarbi made their first contribution in https://github.com/microsoft/markitdown/pull/26
* brc-dd made their first contribution in https://github.com/microsoft/markitdown/pull/19
* Josh-XT made their first contribution in https://github.com/microsoft/markitdown/pull/22
* nyosegawa made their first contribution in https://github.com/microsoft/markitdown/pull/33
* VillePuuska made their first contribution in https://github.com/microsoft/markitdown/pull/38
* SH4DOW4RE made their first contribution in https://github.com/microsoft/markitdown/pull/39
* Y-Kim-64 made their first contribution in https://github.com/microsoft/markitdown/pull/44
* Soulter made their first contribution in https://github.com/microsoft/markitdown/pull/48
* narumiruna made their first contribution in https://github.com/microsoft/markitdown/pull/50
* madduci made their first contribution in https://github.com/microsoft/markitdown/pull/60
* CharlesCNorton made their first contribution in https://github.com/microsoft/markitdown/pull/73
* DIMAX99 made their first contribution in https://github.com/microsoft/markitdown/pull/67
* cybernobie made their first contribution in https://github.com/microsoft/markitdown/pull/63
* kevinclb made their first contribution in https://github.com/microsoft/markitdown/pull/46

**Full Changelog**: https://github.com/microsoft/markitdown/compare/v0.0.1a2...v0.0.1a3

0.0.1a2

Not secure
Initial Release of markitdown

The MarkItDown library is a utility tool for converting various files to Markdown (e.g., for indexing, text analysis, etc.)

It presently supports:

* PDF (.pdf)
* PowerPoint (.pptx)
* Word (.docx)
* Excel (.xlsx)
* Images (EXIF metadata, and OCR)
* Audio (EXIF metadata, and speech transcription)
* HTML (special handling of Wikipedia, etc.)
* Various other text-based formats (csv, json, xml, etc.)

The API is simple:

python
from markitdown import MarkItDown

markitdown = MarkItDown()
result = markitdown.convert("test.xlsx")
print(result.text_content)

Page 2 of 2

© 2025 Safety CLI Cybersecurity Inc. All Rights Reserved.