README.md in eps-0.3.6 vs README.md in eps-0.3.7

- old
+ new

@@ -5,11 +5,11 @@ - Build predictive models quickly and easily - Serve models built in Ruby, Python, R, and more Check out [this post](https://ankane.org/rails-meet-data-science) for more info on machine learning with Rails -[![Build Status](https://travis-ci.org/ankane/eps.svg?branch=master)](https://travis-ci.org/ankane/eps) +[![Build Status](https://github.com/ankane/eps/workflows/build/badge.svg?branch=master)](https://github.com/ankane/eps/actions) ## Installation Add this line to your application’s Gemfile: @@ -132,11 +132,11 @@ ```ruby {description: "a beautiful house on top of a hill"} ``` -This creates features based on word count (term frequency). +This creates features based on [word count](https://en.wikipedia.org/wiki/Bag-of-words_model). You can specify text features explicitly with: ```ruby Eps::Model.new(data, target: :price, text_features: [:description]) @@ -145,16 +145,16 @@ You can set advanced options with: ```ruby text_features: { description: { - min_occurences: 5, - max_features: 1000, - min_length: 1, - case_sensitive: true, - tokenizer: /\s+/, - stop_words: ["and", "the"] + min_occurences: 5, # min times a word must appear to be included in the model + max_features: 1000, # max number of words to include in the model + min_length: 1, # min length of words to be included + case_sensitive: true, # how to treat words with different case + tokenizer: /\s+/, # how to tokenize the text, defaults to whitespace + stop_words: ["and", "the"] # words to exclude from the model } } ``` ## Full Example @@ -216,10 +216,10 @@ ```ruby PriceModel.build ``` -This saves the model to `price_model.pmml`. Be sure to check this into source control. +This saves the model to `price_model.pmml`. Check this into source control or use a tool like [Trove](https://github.com/ankane/trove) to store it. Predict with: ```ruby PriceModel.predict(house)