html-document-parser
v1.1.1
Published
HTML markup parser in a web browser and in Node.js
Downloads
7
Readme
html-document-parser
Parses an arbitrary HTML markup snippet into DOM Elements in a web browser or in Node.js.
API
Parse HTML document
import parseHtmlDocument from 'html-document-parser'
const html = '<html><head></head><body><div></div></body></html>'
const document = parseHtmlDocument(html)
document.childNodes.length === 1
document.childNodes[0].tagName === 'HTML'
- Input: HTML document markup.
- Output: DOM Document.
Parse HTML snippet
import parseHtmlSnippet from 'html-document-parser/snippet'
const html = 'Some <b>bold</b><span> text</span>'
const root = parseHtmlSnippet(html)
root.childNodes.length === 3
// `node.nodeType === 3` means "text node".
root.childNodes[0].nodeType === 3
root.childNodes[0].nodeValue === 'Some '
// `node.nodeType === 1` means "DOM element".
root.childNodes[1].nodeType === 1
root.childNodes[1].tagName === 'B'
root.childNodes[2].nodeType === 1
root.childNodes[2].tagName === 'SPAN'
- Input: Arbitrary HTML markup snippet.
- Output: The input markup converted to DOM Elements.
Under the hood
Under the hood, it uses native DOMParser
in web browsers and xmldom
polyfill in Node.js.
GitHub
On March 9th, 2020, GitHub, Inc. silently banned my account (erasing all my repos, issues and comments) without any notice or explanation. Because of that, all source codes had to be promptly moved to GitLab. The GitHub repo is now only used as a backup (you can star the repo there too), and the primary repo is now the GitLab one. Issues can be reported in any repo.