misc/homepage.erb in rmmseg-cpp-0.2.7 vs misc/homepage.erb in rmmseg-cpp-0.2.9

- old
+ new

@@ -1,9 +1,9 @@ <%# -*- mode: text; coding: utf-8 -*- %> <% $title = "rmmseg-cpp Homepage" - $authors = { 'pluskid' => 'http://pluskid.lifegoo.com' } + $authors = { 'pluskid' => 'http://blog.pluskid.org' } %> <% chapter "Introduction" do %> rmmseg-cpp is a high performance Chinese word segmentation utility for @@ -108,12 +108,29 @@ <code> RMMSeg::Dictionary.load_dictionaries </code> - Now rmmseg-cpp will be ready to do segmenting. + Now rmmseg-cpp will be ready to do segmenting. If you want to load your own customized + dictionaries, please customize <tt>RMMSeg::Dictionary.dictionaries</tt> before calling + <tt>load_dictionaries</tt>. e.g. + <code> + RMMSeg::Dictionary.dictionaries = [[:chars, "my_chars.dic"], + [:words, "my_words.dic"], + [:words, "my_words2.dic"]] + </code> + + The basic format for char-dictionary and word-dictionary are similar. For each line, + there is a number, then *a* space, then the string. Note there *SHOULD* be a newline + at the end of the dictionary file. And the number in char-dictionary and word-dictionary + has different meaning. + + In char-dictionary, the number means the frequency of the character. In word-dictionary, + the number mean the number of characters in the word. Note that this is NOT the number + of *bytes* in the word. + <% end %> <% section "Ferret Integration" do %> To use rmmseg-cpp with Ferret, you'll need to @require@ the @@ -137,11 +154,11 @@ A complete example can be found in <tt>misc/ferret_example.rb</tt>. The result of running that example is shown in <%= xref "Ferret Example Screenshot" %>. <% figure "Ferret Example Screenshot" do %> - !http://pluskid.lifegoo.com/wp-content/uploads/2008/02/rmmseg.png! + !http://lifegoo.pluskid.org/wp-content/uploads/2008/02/rmmseg.png! <% end %> <% end %> <% section "Normal Ruby program" do %> @@ -172,8 +189,8 @@ <% end %> <% chapter "Resources" do %> * "Project Home":http://rubyforge.org/projects/rmmseg-cpp/: The Project page at RubyForge. * "RDoc of rmmseg-cpp":http://rmmseg-cpp.rubyforge.org/rdoc/index.html: The auto generated rdoc of RMMSeg. - * "Free Mind":http://pluskid.lifegoo.com/: The author's blog. + * "Free Mind":http://blog.pluskid.org/: The author's blog. * "Author's Email":mailto:pluskid@gmail.com: Contact me if you have any problem. <% end %>