README.md in postrank-uri-1.0.17 vs README.md in postrank-uri-1.0.18

- old
+ new

@@ -12,11 +12,11 @@ - ... and the list goes on - check the specs. ## API - **PostRank::URI.extract(text)** - Detect URIs in text, discard bad TLD's -- **PostRank::URI.clean(uri)** - Unescape, normalize, apply c18n filters - 95% use case. +- **PostRank::URI.clean(uri)** - Unescape, normalize, apply c14n filters - 95% use case. - **PostRank::URI.normalize(uri)** - Apply RFC normalization rules, discard extra path characters, drop anchors - **PostRank::URI.unescape(uri)** - Unescape URI entities, handle +/%20's, etc - **PostRank::URI.escape(uri)** - Escape URI @@ -31,10 +31,10 @@ >> PostRank::URI.clean('link.to?a=b&utm_source=FeedBurner#stuff') [ [0] "http://link.to/?a=b" ] -## C18N +## C14N -As part of URI canonicalization the library will remove common tracking parameters from Google Analytics and several other providers. Beyond that, host-specific rules are also applied. For example, nytimes.com likes to add a 'partner' query parameter for tracking purposes, but which has no effect on the content - hence, it is removed from the URI. For full list, see the c18n.yml file. +As part of URI canonicalization the library will remove common tracking parameters from Google Analytics and several other providers. Beyond that, host-specific rules are also applied. For example, nytimes.com likes to add a 'partner' query parameter for tracking purposes, but which has no effect on the content - hence, it is removed from the URI. For full list, see the c14n.yml file. Detecting "duplicate URLs" is a hard problem to solve (expensive in all senses), instead we are compiling a manually assembled database. If you find cases which are missing, please do report them, or send us a pull request! \ No newline at end of file