# CrazyHarry
[![Build Status](https://secure.travis-ci.org/lonelyplanet/crazy_harry.png)](http://travis-ci.org/lonelyplanet/crazy_harry)
CrazyHarry is a high-level html fragment sanitiser/cleaner in use at
[Lonely Planet](http://www.lonelyplanet.com). It is based on [Flavour
Jones's](http://mike.daless.io) [Loofah Gem](https://github.com/flavorjones).
[Loofah](https://github.com/flavorjones) is a great tool and we've been
using it in a number of different projects. Hoewever, we found that we
were repeating the same types of cleaning job in multiple places.
CrazyHarry wraps up a number these tasks in a simple DSL, while adding commands for a few edge cases that are not straightforward with Loofah.
## Installation
gem 'crazy_harry'
bundle install
## Usage
object_with_description.each do |obj|
if descriptions[l.external_id]
sanitised_fragment = CrazyHarry.fragment(descriptions[obj.external_id])
.redact!( unsafe: true, tags: 'img')
.change!( from: 'b', to: 'h3' )
.change!( from: 'strong', to: 'h3' )
obj.update_column(:description, sanitised_fragment.to_s)
end
end
## Default Actions
It automatically removes blank tags, converts `
` tags to wrapped
paragraphs and de-dupes content.
## Chaining
As per the previous example, all calls **except** `.strip!` (which removes
all markup) may be chained. (`.strip!` can be the last element in a
chain. See below).
## Scoping and Targeting by Content
All commands except `.strip!` accept `scope:` and `text:` attributes:
CrazyHarry.fragment( '
Hotels
Tents' ).change!( from: 'b', to: 'em', scope: 'p' ).to_s will produce:Hotels
Tents while: CrazyHarry.fragment( 'Hot hotels in Saigon' ).change!( from: 'b', to: 'em', text: 'hotels' ).to_s will produce: Hot hotels in Saigon ## Adding Attributes Use the `.translate` command to do this: harry.fragment( 'HeaderContent
' ).translate!( add_attributes: { class: 'partner'} ).to_s will return: HeaderContent
If a tag already has an attribute, the new attribute will be appended to the existing one: HeaderContent
## Stripping ### Specific Tags Use the `.redact!` command. It **does not** strip unsafe tags by default. To do this, pass the `unsafe: true` option. ### All Tags Use the `.strip!` command. It can be used as the last tag in a chain (with `.translate( from_text:Some Content
` and `Some Content
` will not be treated as duplicates. * Be able to turn off default actions. * It should be able to work on documents as well as fragments. * Merge `.translate!` with `.change!` * Foster other orphaned tags besides just li. ## Contributing 1. Fork it 2. Create your feature branch (`git checkout -b my-new-feature`) 3. Commit your changes (`git commit -am 'Added some feature'`) 4. Push to the branch (`git push origin my-new-feature`) 5. Create new Pull Request