I've written a couple of programs to assist in converting various HTML sources to markdown.
I am also using third party programs to help with this.
# Browser Extensions
See also: [[Data Hoarding#Live#Scraping]]
- [[MarkDownload]] (doesn't handle HTML tables)
# Command Line
## html2md
[[html2md by suntong - doc converter]] is my preferred converter.
## Alternatives
- [[html2md by suntong - doc converter]]
- [[html2text by jaytaylor - doc converter]]
- [[reader by mrusme - doc converter and viewer]]
- [[pandoc - Document Conversion]] (very finicky, output often contains HTML and/or pandoc's own version of "markdown")
# Libraries
- [[downmark by yujiahaol68]] - https://github.com/yujiahaol68/downmark
- [[html-to-markdown by JohannesKaufmann]]
# Mine
## ffff & mdlinkrewrite
A collection of scripts I've written.
The various `f*` programs and `mdlinkrewrite` extract and rewrite data found in markdown files.
`f*` sets frontmatter and extract date/times from files.
`mdlinkrewrite` converts absolute URLs into internal wikilinks, it can rewrite both HTML and markdown style URLs and is context aware of markdown tables so that link names can be escaped for compatibility with [[Obsidian Editor]].
Usage:
```sh
mdlinkrewrite file.md http://url.to/remove/ prefix-to-add
```