= rlibsphinxclient
A Ruby wrapper for pure C searchd client API library. This is *highly experimental* library
so use it at your own risk.
== Installing the rlibsphinxclient gem
This gem can be more difficult to install than the typical Ruby extension. First you have to
install Sphinx and Sphinx pure C searchd client API library.
=== Step 1: Install pure C Sphinx client API
Go to http://sphinxsearch.com/downloads.html and download the latest stable release.
Then go to api/libsphinxclient directory and install client API to your
preferred folder (I like /opt/sphinx):
cd api/libsphinxclient
./configure --prefix=/opt/sphinx
make
sudo make install
On Max OS X you may get the following error:
configure: error: C++ preprocessor "/lib/cpp" fails sanity check
In this case you should specify environment variable for ./configure script:
CXXCPP="gcc -E" ./configure --prefix=/opt/sphinx
=== Step 2: Install rlibsphinxclient gem
If you have installed the Sphinx to /opt/sphinx, just run:
sudo gem install kpumuk-rlibsphinxclient --no-ri --no-rdoc
Otherwise, specify where sphinx has been installed to:
sudo gem install kpumuk-rlibsphinxclient --no-ri --no-rdoc -- --with-libsphinxclient-dir=/opt/sphinx-0.9.9
On Mac OS X with MacPorts you should specify ARCHFLAGS environment variable:
sudo env ARCHFLAGS="-arch i386" gem install kpumuk-rlibsphinxclient --no-rdoc --no-ri -- --with-libsphinxclient-dir=/opt/sphinx-0.9.9
If you are working on Ruby on Rails application, you can add gem dependency to your
config/environment.rb:
config.gem 'kpumuk-rlibsphinxclient', :lib => 'sphinx'
Also don't forget to remove the sphinx plugin, because it's functionality
is completely covered by this gem.
== Using the rlibsphinxclient gem
The gem includes two versions of the client API: pure Ruby and wrapper for pure C client API.
They are 100% equivalent in use, so you can switch to any of them. To use pure Ruby client,
instantiate the Sphinx::Client, for pure C wrapper use Sphinx::FastClient.
Important note: you should call destroy method when you do not need client API any more.
The reason for that is the C wrapper saves all query results in memory, and frees them in
the destroy method call. You can omit this call in pure Ruby library, but I'd like
to do call in any case just for consistence (to be able to switch to another client).
Important note #2: to ensure that destroy method will be called, use ensure
block:
begin
@sphinx = Sphinx::FastClient.new
@sphinx.Query('test')
ensure
@sphinx.destroy
end
== Examples of usage
Ok, let's take a look at the examples. First, here is the search example with all possible
filters and options set:
require 'sphinx'
@sphinx = Sphinx::FastClient.new
@sphinx.SetServer('localhost', 3312)
@sphinx.SetLimits(1, 100, 20, 30)
@sphinx.SetMaxQueryTime(5)
@sphinx.SetMatchMode(Sphinx::Client::SPH_MATCH_EXTENDED2)
@sphinx.SetRankingMode(Sphinx::Client::SPH_RANK_BM25)
@sphinx.SetSortMode(Sphinx::Client::SPH_SORT_RELEVANCE)
@sphinx.SetFieldWeights('group_id' => 10, 'rating' => 20)
@sphinx.SetIndexWeights('test1' => 20, 'test2' => 30)
@sphinx.SetIDRange(1, 100)
@sphinx.SetFilter('group_id', [1], true)
@sphinx.SetFilterRange('group_id', 1, 2, true)
@sphinx.SetFilterFloatRange('rating', 1, 3, true)
@sphinx.SetGroupBy('created_at', Sphinx::Client::SPH_GROUPBY_DAY)
@sphinx.SetGroupDistinct('group_id')
@sphinx.SetRetries(5, 10)
results = @sphinx.Query('test')
@sphinx.destroy
BuildKeywords example:
require 'sphinx'
@sphinx = Sphinx::FastClient.new
results = @sphinx.BuildKeywords('wifi gprs', 'test1', true)
@sphinx.destroy
BuildExcerpts example:
require 'sphinx'
@sphinx = Sphinx::FastClient.new
results = @sphinx.BuildExcerpts(['what the world', 'London is the capital of Great Britain'], 'test1', 'the')
@sphinx.destroy
UpdateAttributes example:
require 'sphinx'
@sphinx = Sphinx::FastClient.new
results = @sphinx.UpdateAttributes('test1', ['group_id'], { 2 => [1] })
@sphinx.destroy
== Benchmarks
The reason to write this gem was to investigate why we keep getting timeout errors
when using Sphinx (occur rarely, but they are annoying me.) But the side effect of
this library was the slight search performance improvement: Ruby library is slower
when generating Sphinx request and parsing its results.
require 'sphinx'
require 'benchmark'
def run_test(klass)
sphinx = klass.new
sphinx.Query('test hello')
ensure
sphinx.destroy
end
Benchmark.bm do |x|
x.report('pure ruby') { 1000.times { run_test(Sphinx::Client) } }
x.report('c wrapper') { 1000.times { run_test(Sphinx::FastClient) } }
end
On my MBP I got the following results:
user system total real
pure ruby 0.420000 0.230000 0.650000 ( 14.721659)
c wrapper 0.060000 0.090000 0.150000 ( 2.248645)
== Who are the authors?
This plugin has been created in Scribd.com for our internal use and then the sources were opened
for other people to use. All the code in this package has been developed by Dmytro Shteflyuk
for Scribd.com and is released under the GPLv2 license. For more details, see LICENSE file.