@mtcute/markdown-parser
v0.18.0-rc.5
Published
Markdown entities parser for mtcute
Downloads
2,170
Readme
@mtcute/markdown-parser
Markdown entities parser for mtcute
NOTE: The syntax implemented here is not compatible with Bot API Markdown, nor MarkdownV2.
Please read Syntax below for a detailed explanation
Features
- Supports all entities that Telegram supports
- Supports nested and overlapping entities
- Supports dedentation
- Interpolation!
Usage
import { md } from '@mtcute/markdown-parser'
tg.sendText(
'me',
md`
Hello, **me**! Updates from the feed:
${await getUpdatesFromFeed()}
`
)
Syntax
Inline entities
Inline entities are defined by some tag surrounding some text, and processing them simply strips their tag.
Supported entities:
- Bold, tag is
**
- Italic, tag is
__
- Underline, tag is
--
(NON-STANDARD) - Strikethrough, tag is
~~
- Spoiler, tag is
||
(NON-STANDARD) - Code (monospaced font), tag is `
- Note that escaping text works differently inside code, see below.
Unlike CommonMark, we use the symbol itself and not its count. Thus, using
*
(asterisk) will always produce bold, and using_
(underscore) will always produce italic.This eliminates a lot of confusion, like:
_bold_
→ bold,**italic**
→ italic
| Code | Result (visual) | Result (as HTML) |
|----------------------------------------------|-------------------|------------------------------|
| **bold**
| bold | <b>bold</b>
|
| __italic__
| italic | <i>italic</i>
|
| --underline--
| underline | <u>underline</u>
|
| ~~strikethrough~~
| ~~strikethrough~~ | <s>strikethrough</s>
|
| ||spoiler|| | N/A | <spoiler>spoiler</spoiler>
|
| *whatever*
| *whatever* | *whatever*
|
| _whatever_
| _whatever_ | _whatever_
|
| `hello world` | hello world
| <code>hello world</code>
|
| `text` | __text__
| <code>__text__</code>
|
Pre
Pre represents a single block of code, optionally with a language.
This entity starts with ``` (triple backtick), optionally followed with language name and a must be followed with a line break, and ends with ``` (triple backtick), optionally preceded with a line break.
| Code | Result (visual) | Result (as HTML) |
|--------------------------------------------------------------------|---------------------------|---------------------------------------------------------------------------------------------|
| ```hello``` | hello
| <pre>hello</pre>
|
| ```hello``` | hello
| <pre>hello</pre>
|
| ```javascriptconst a = <br>\`\`\`</code></pre> | <code>const a =
| <pre language="javascript"> const a = ``</pre> |
Links
Links are parsed exactly the same as standard markdown (except references are not supported).
Defined like this: [Link text](https://example.com)
.
- Link text may also contain any formatting, but link cannot contain other links inside (obviously).
[
(opening square bracket) inside link text will be treated like a normal character.
A markdown-style link can also be used to define a name mention like this: [Name](tg://user?id=1234567)
,
where 1234567
is the ID of the user you want to mention.
Additionally, a markdown-style link can be used to define a custom emoji like this:
[😄](tg://emoji?id=123456)
, where 123456
is ID of the emoji.
Note: It is up to the client to look up user's input entity by ID. In most cases, you can only use IDs of users that were seen by the client while using given storage.
Alternatively, you can explicitly provide access hash like this:
[Name](tg://user?id=1234567&hash=abc
, whereabc
is user's access hash written as a base-16 unsigned integer. Order of the parameters does matter, i.e.tg://user?hash=abc&id=1234567
will not be processed as expected.
| Code | Result (visual) | Result (as HTML) |
|------------------------------------|--------------------------------|--------------------------------------------------|
| [Google](https://google.com)
| Google | <a href="https://google.com">Google</a>
|
| [__Google__](https://google.com)
| Google | <a href="https://google.com"><i>Google</i></a>
|
| [empty link]()
| empty link | empty link
|
| [empty link]
| [empty link] | [empty link]
|
| [User](tg://user?id=1234567)
| N/A | N/A |
| [😄](tg://emoji?id=123456)
| N/A | N/A |
Nested and overlapping entities
Quite a powerful feature of this parser is the ability to process overlapping entities. Only inline entities (except code) can be overlapped.
Since inline entities are only defined by their tag, and nesting same entities doesn't make sense, you can think of the tags just as start/end markers, and not in terms of nesting.
| Code | Result (visual) | Result (as HTML) |
|-------------------------------|---------------------------|----------------------------------------|
| **Welcome back, __User__!**
| Welcome back, User! | <b>Welcome back, <i>User</i>!</b>
|
| **bold __and** italic__
| bold and italic | <b>bold <i>and</i></b><i> italic</i>
|
Interpolation
Being a tagged template literal, md
supports interpolation.
You can interpolate one of the following:
string
- will not be parsed, and appended to plain text as-is- In case you want the string to be parsed, use
md
as a simple function: md`... ${md('bold')} ...`
- In case you want the string to be parsed, use
number
- will be converted to string and appended to plain text as-isTextWithEntities
orMessageEntity
- will add the text and its entities to the output. This is the type returned bymd
itself:const bold = md`**bold**` const text = md`Hello, ${bold}!`
- falsy value (i.e.
null
,undefined
,false
) - will be ignored
Because of interpolation, you almost never need to think about escaping anything, since the values are not even parsed as Markdown, and are appended to the output as-is.