groonga - An open-source fulltext search engine and column store.

News

Release 5.0.3 - 2015-04-29

Improvements

  • [Tokenizers][Regular expression] Skip the last one character token.
  • [mruby] Supported regexp pattern for estimating size.
  • [mruby] Supported size estimation for accessor.
  • [logical_range_filter] Removed GRN_LOGICAL_RANGE_FILTER_ENABLED environment variable which is introduced since Groonga 5.0.2. Use GRN_LOGICAL_RANGE_FILTER_THRESHOLD=0 to disable range index search feature.
  • [logical_range_filter] Supported negative limit and offset.
  • [Windows] Used Groonga Admin in package.
  • [logical_range_filter] Changed threshold meaning:
    • threshold <= 0.0: always use range index
    • threshold >= 1.0: never use range index
  • [dump] Supported plugin.
  • [dump] Added the following options:
    • --dump_plugins [yes(default)/no]
    • --dump_schema [yes(default)/no]
    • --dump_records [yes(default)/no]
    • --dump_indexes [yes(default)/no]
  • [API] Added grn_plugin_get_ruby_suffix().
  • [dump] Fixed order to put index columns after reference columns because index column may refer reference columns.
  • [dump] Don't dump records of lexicon.
  • [dump] Show _id for TABLE_NO_KEY again.
  • [dump] Used offline index construnction.
  • Increased max hash key size from 4KiB (4096Byte) to 64KiB - 1 (65535Byte).
  • Increased max cache key size from 4KiB (4096Byte) to 64KiB - 1 (65535Byte).
  • Improved performance for nested index search.
  • Used index for nonexistent reference column value.
  • [experimental] Added plugin functions/vector. It includes vector_size function.
  • [Windows] Updated Visual Studio version [GitHub groonga/meetup#4] [Reported by Hiroyuki Mizuhara]
  • [cache_limit] Expired old caches when the max N caches is decreased. [Suggested by Gurunavi, Inc.]
  • Show more information such as errno for errors.
  • [windows] Used secure functions on Windows.
  • Added the following APIs to change log rotate threshold in file size.
    • grn_default_logger_set_rotate_threshold_size()
    • grn_default_logger_get_rotate_threshold_size()
    • grn_default_query_logger_set_rotate_threshold_size()
    • grn_default_query_logger_get_rotate_threshold_size()
  • [experimental] Supported log rotation. The feature is disabled by default. You can enable log rotation by the following options:
    • --log-rotate-threshold-size
    • --query-log-rotate-threshold-size
  • [GQTP] Documented about GQTP server.
  • [groonga executable file] Documented groonga executable file partially.
  • Supported Ubuntu 15.04 (Vivid Vervet).
  • Supported Debian 8.0 (Jessie).
  • [groonga-httpd] Updated bundled nginx version to the latest mainline (1.8.0).

Fixes

  • [windows] Fixed a bug that Offline index construction is failed for large data (at least 1GB or larger) with Groonga built by Microsoft Visual C++. [Reported by Hideki ARAI]
  • [mruby] Made \\ index searchable in regular expression.
  • Fixed a bug that GRN_II_CURSOR_SET_MIN_ENABLE=yes doesn't return some matched records.
  • [sharding] Fixed a bug that partial range is handled as all range.
  • [logical_range_filter] Fixed a bug that :order => "descending" doesn't work.
  • [logical_count] Re-supported counting with range index.
  • Fixed a bug causing malfunction of grn_pat_del() and added a test for invalid patricia trie node add case. [groonga-dev,03177] [Reported by yuya sako]

Thanks

  • Hideki ARAI
  • Hiroyuki Mizuhara
  • Gurunavi, Inc.
  • yuya sako

Release 5.0.2 - 2015-03-31

It's a bug fix release of 5.0.1.

Improvements

  • Supported MessagePack 1.0.1. [Reported by Hiroshi Hatake]
  • [logical_range_filter] Disabled range index by default. It's enabled when you set the enviromnent variable GRN_LOGICAL_RANGE_FILTER_ENABLED to yes.

Fixes

  • Fixed a regression bug that JSONP doesn't work. It was introduced in Groonga 4.1.1.
  • [windows] Fixed a bug that crash on x86 for Groonga 5.0.1. [groonga-dev,03131] [Reported by Atsushi Shinoda]
  • Fixed a crash bug that libedit is not properly initialized. The problem is fixed in the environment such as CentOS 7.

Thanks

  • Atsushi Shinoda
  • Hiroshi Hatake

Release 5.0.1 - 2015-03-29

Improvements

  • [logical_range_filter] Supported filter and sort.

  • Supported range search by multiple column index.

  • Added API Overview document for users who want to use Groonga as library.

  • [incompatible] Changed internal type of _score to floating point number from 32bit integer number. This is incompatible change for DB API users. This isn't incompatible change for query API users. It means that users who just use select aren't affected. Use the following code that works with both older and newer Groonga:

    grn_obj *score;
    double score_value;
    
    if (score->header.domain == GRN_DB_FLOAT) {
      score_value = GRN_FLOAT_VALUE(score);
    } else {
      score_value = (double)GRN_INT32_VALUE_FLOAT_VALUE(score);
    }
    
  • [select] Added more strict check for invalid drilldown parameter.

  • Added grn_ctx_get_all_tables(). [Suggested by Masatoshi Teruya]

  • Supported to customize score function. See Scorer for details.

  • [incompatible] Custom score function feature introduced API and ABI incompatibilities in DB API layer. If you're using grn_search_optarg, please check that your code initializes your grn_search_optarg by 0 like the following:

    grn_search_optarg options;
    memset(&options, 0, sizeof(grn_search_optarg));
    

    If your code do the above thing, your code is API compatible and ABI incompatible. You just need to rebuild your code without modification.

    If your code doesn't the above thing, you need to added the above thing to your code.

  • Added the following predicates that check grn_obj type to DB API:

    • grn_obj_is_table()
    • grn_obj_is_proc_proc()
    • grn_obj_is_function_proc()
    • grn_obj_is_selector_proc()
    • grn_obj_is_scorer_proc()
  • [experimental] Supported skipping posting list when searching popular term and rare term at the same time. It will improve performance. Set GRN_II_CURSOR_SET_MIN_ENABLE environment variable to 1 to enable the feature. The feature is disabled by default.

  • [doc] Added in_values document.

  • [doc] Added logical_count document.

  • [mruby] Implemented custom #inspect method. Is is useful for debugging.

  • Added scorer_tf_at_most scorer. It limits not to exceed specified score regardless of term frequency.

  • [mruby] Supported estimating matched records for selecting index search or sequential search.

  • Added the following functions to estimate size by index:

    • grn_expr_estimate_size()
    • grn_ii_estimate_size_for_query()
    • grn_ii_estimate_size_for_lexicon_cursor()
  • Added missing NormalizerAuto availability check. [GitHub#283] [Reported by Tasuku SUENAGA]

  • Dropped Visual Studio 2010 support.

  • [experimental][mecab] Supported chunked tokenization. This feature is a workaround for MeCab's "too long sentense" error. Specify yes to GRN_MECAB_CHUNKED_TOKENIZE_ENABLED environment variable to enable it. By this configuration, Groonga splits a long text (8192 bytes over text by default) into small chunks and passes each chunk to MeCab. By this process, the above error isn't occurred. Additionally, you can customize chunk threshold bytes by GRN_MECAB_CHUNK_SIZE_THRESHOLD environment variable. Note that ,, ., !, ?, U+3001 IDEOGRAPHIC COMMA, U+3002 IDEOGRAPHIC FULL STOP, U+FF01 FULLWIDTH EXCLAMATION MARK and U+FF1F FULLWIDTH QUESTION MARK are treated as chunk delimiter characters.

  • Supported --pid-file in server mode of groonga executable file.

  • [groonga-httpd] Supported graceful stop to clean Groonga. It doesn't terminate the open connections immediately.

  • [experimental] Supported regular expression. See Regular expression to know about how to use regular expression.

  • [experimental] Added plugin_unregister command.

  • [http][load] Added "," as chunk separator in POST data. It decreases internal buffer size and improves load time when POST data don't include any new line.

  • [doc] Added Tokenizers document.

  • Improved POSIX.2 compatibility by using . as bash's "source" command replacement. [GitHub#317] [Patch by Jun Kuriyama]

  • [windows] Changed to the default IO version 1. It reduces disk usage on Windows. [groonga-dev,03118] [Tested by ongaeshi]

  • [httpd] Updated bundled nginx version to the latest mainline (1.7.11).

  • Changed mime-type for TSV output to text/tab-separated-values from text/plain.

  • [TokenFilterStopWord] Supported Offline index construction. [GitHub#296] [Patch by Naoya Murakami]

Fixes

  • Fixed not to use obsolete --address parameter in the default groonga.conf. --bind-address is used instead. [Groonga-talk] [Reported by Dewangga]
  • [truncate] Fixed a bug that TABLE_NO_KEY table can't be truncated.
  • [mecab] Reduced needless logs for "empty token" and "ignore empty token".
  • Fixed a bug that wrong section in index is used. It means that wrong search result is returned. If you satisfy all of the following conditions, this bug is occurred:
    • Multiple indexes are available.
    • The first defined index or the last defined index are multi-column indexes.
    • When both of the first defined index and the last defined index are multi-column indexes, source column orders are different in them.
  • Fixed a bug that passing Groonga command line to groonga executable file from shell command line style usage always returns 0 as exit code. For example, groonga DB_PATH nonexistent_command always returned 0 as exist code.
  • Fixed a bug that plugin path may be broken when two or more plugins registered. [Reported by Naoya Murakami]
  • Fixed a bug that Lexicon.index.source_column_name style in match_columns doesn't work when source column specified by source_column_name has two or more indexes. [Reported by Naoya Murakami]

Thanks

  • Masatoshi Teruya
  • Tasuku SUENAGA
  • Dewangga
  • Jun Kuriyama
  • ongaeshi
  • Naoya Murakami

Release 5.0.0 - 2015-02-09

  • Bump version to 5.0.0!

Improvements

  • [doc] Added Security about Script syntax.
  • [experimental] Added sharding plugin. Execute register sharding to enable this feature in advance, then use logical_count to select records.
  • [cmake] Supported embedded Groonga with Clang. It fixed compilation failure on FreeBSD 10.1. [MDEV-7293] [Reported by Bernard Spil]
  • Supported to customize plugins directory. Set GRN_PLUGINS_DIR environment variable.

Fixes

  • Fixed build failure when system has an incompatible version of onigmo/oniguruma headers installed. [GitHub#276] [Patch by Akinori MUSHA]
  • Fixed time related build failure on MSVC [GitHub#237]

Thanks

  • Akinori MUSHA
  • Bernard Spil

The old releases