Maruku: a Markdown interpreter

Maruku is a Markdown interpreter written in Ruby.

Maruku allows you to write in an easy-to-read-and-write syntax, like this:

This document in Markdown

Then it can be translated to HTML:

This document in HTML

or Latex, which is then converted to PDF:

This document in PDF

Maruku implements the original Markdown syntax (HTML or PDF, translated by Maruku).

Markdown implements also all the improvements in PHP Markdown Extra.

Moreover, it implements ideas from MultiMarkdown.

Authors

Maruku has been developed so far by Andrea Censi. Contributors are most welcome!


Table of contents:


Download

The development site is http://rubyforge.org/projects/maruku/.

Download current gem at http://rubyforge.org/frs/?group_id=2795 or try to install with:

$ gem install maruku

Anonymous access to the repository is possible with:

$ svn checkout svn://rubyforge.org/var/svn/maruku

If you want commit-access, just create an account on Rubyforge and drop me a mail.

Usage

This is the basic usage:

require 'maruku'

doc = Maruku.new(markdown_string)
puts doc.to_html

or, if you install through RubyGems,

require 'rubygems'
require 'maruku'

This outputs a complete XHTML 1.0 document:

puts doc.to_html_document

You can have the REXML document tree with:

tree = doc.to_html_document_tree

From the command line

There are two command-line programs installed: maruku and marutex

maruku converts Markdown in HTML:

$ maruku file.md  # creates file.html

marutex converts Markdown in TeX, then calls pdflatex to transform to PDF:

$ marutex file.md  # creates file.tex and file.pdf

Examples of PHP Markdown Extra syntax

<div markdown="1" style="border: solid 1px black">
   This is a div with Markdown **strong text**
</div>

This is a div with Markdown strong text

Maruku and Bluecloth

The other Ruby implementation of Markdown is Bluecloth.

Maruku is much different in philosophy from Bluecloth: the biggest difference is that parsing is separated from rendering. In Maruku, an in-memory representation of the Markdown document is created. Instead, Bluecloth mantains the document in memory as a String at all times, and does a series of gsub to transform to HTML.

The in-memory representation makes it very easy to export to various formats (altough, for, now)

Other improvements over Bluecloth:

New meta-data syntax

Maruku implements a syntax that allows to attach "meta" information to objects.

Meta-data for the document

Meta-data for the document itself is specified through the use of email headers:

Title: A simple document containing meta-headers
CSS: style.css

Content of the document

When creating the document through

Maruku.new(s).to_html_document

the title and stylesheet are added as expected.

Meta-data for elements

Maruku introduces a new syntax for attaching metadata to paragraphs, tables, and so on.

For example, consider the creation of two paragraphs:

Paragraph 1 is a warning.

Paragraph 2

Now you really want to attach a 'class' attribute to the paragraphs (for example for CSS styling). Maruku allows you to use:

@ class: warning
Paragraph 1 is a warning

Paragraph 2

You can add more by separating with a ;:

@ class: warning; id: warning1
Paragraph 1 is a warning

A meta-data declaration is composed of

  1. newline
  2. an at-symbol '@'
  3. a series of name-value pairs. Each name-value is separated by a colon :, pairs are separated by semi-colons ;

Many declaration can be used, and they refer to the following object:

@ class: warning
@ id: warning1
Paragraph 1 is a warning

These can also be separated by newlines:

@ class: warning

@ id: warning1

Paragraph 1 is a warning

Also, if the value is not present, it defaults to true:

@ test

This paragraph has the attribute 'test' set.

List of meta-data

title, subject

(document) Sets the title of the document (HTML: used in the TITLE element).

css

(document, HTML) Url of stylesheet.

html_use_syntax

(document, HTML) Use the syntax library to add source highlighting.

latex_use_listings

(document, LaTex) Use fancy listing package for better displaying code blocks.

style, id, class

(any block object, HTML) Standard CSS attributes are copied.

lang

(code blocks) Name of programming language (ruby) for syntax highlighting (does not work yet)

Default for this is code_lang in document.

code_show_spaces

Shows tabs and newlines (default is read in the document object).

code_background_color

Background color for code blocks. (default is read in the document object).

The format is either a named color (green, red) or a CSS color of the form #ff00ff.

Examples

An example of this is the following:

@¬code_show_spaces;¬code_background_color:¬green

»   ¬One¬space
»   ¬¬Two¬spaces
»   »   ¬»   Tab,¬space,¬tab
»   »   »   »   Tab,¬tab,¬tab¬and¬all¬is¬green!

That will produce:

¬One¬space
¬¬Two¬spaces
»   ¬»   Tab,¬space,¬tab
»   »   »   Tab,¬tab,¬tab¬and¬all¬is¬green!

Example with css-style color:

@ code_background_color: #455678

	A strange color

produces:

A strange color

Or highlighting (does not work well yet):

@ lang: xml
	<div style="text-align:center">Div</div>

produces:

<div style="text-align:center">Div</div>

Future developments

I think that Pandoc and MultiMarkdown are very cool projects. However, they are written in Haskell and Perl, respectively. I would love to have an equivalent in Ruby.

Syntax improvements

Things I'm thinking about:


  1. I really was missing those.