Parse Wiki Text released

Fredrik · November 2, 2018, 8:30am

Parse Wiki Text parses wiki text from Mediawiki into a tree of elements. This is extremely challenging to do correctly, since wiki text was never designed to be parsed, only to be transformed through a step by step procedure into a fixed HTML format that is tied to the way websites were designed in the past. See the readme file for more information and some interesting reading about the challenges of parsing wiki text. You can find Parse Wiki Text on crates.io and Github.

There are several reasons I'm announcing this on the forum:

It's the first crate I publish. I would like feedback.
I would like to make Parse Wiki Text more well-known. There are probably many people who would benefit from processing wiki text.
This may be interesting reading to anyone interested in parsers. See the readme file for a description of the challenges in parsing wiki text.
I would like to apply to be crate of the week. A Wikipedia browser has been crate of the week before, and it uses naive regular expressions to parse wiki text, so I think a high quality wiki text parser deserves to be the next crate of the week.
I would like to invite a discussion. Ask me anything. It can be about wiki text, making parsers in Rust or my personal experience with Rust.

Happy hacking!

leanhdung1994 · May 3, 2021, 12:22am

Hi @Fredrik , I'm very interested in parsing wikitext. Unfortunately, your website is down. Can I have your email to contact you?

system · August 1, 2021, 12:22am

This topic was automatically closed 90 days after the last reply. We invite you to open a new topic if you have further questions or comments.

Topic		Replies	Views
The view of Wikidata on rust implemented software community	4	966	March 4, 2020
What crate to parse and modify Html?	12	793	March 12, 2024
Help updating a wikimedia-editing tool written in Rust help	5	402	February 8, 2022
Scraper crate help. Basic example help	3	290	May 15, 2023
"Rebuilding" some content in a file help	6	325	March 12, 2021

Parse Wiki Text released

Related Topics