# SPARQL for RDF.rb This is a [Ruby][] implementation of [SPARQL][] for [RDF.rb][]. [![Gem Version](https://badge.fury.io/rb/sparql.png)](https://badge.fury.io/rb/sparql) [![Build Status](https://github.com/ruby-rdf/sparql/workflows/CI/badge.svg?branch=develop)](https://github.com/ruby-rdf/sparql/actions?query=workflow%3ACI) [![Coverage Status](https://coveralls.io/repos/ruby-rdf/sparql/badge.svg?branch=develop)](https://coveralls.io/r/ruby-rdf/sparql?branch=develop) [![Gitter chat](https://badges.gitter.im/ruby-rdf/rdf.png)](https://gitter.im/ruby-rdf/rdf) ## Features * 100% free and unencumbered [public domain](https://unlicense.org/) software. * Complete [SPARQL 1.1 Query][] parsing and execution * SPARQL results as [XML][SPARQL XML], [JSON][SPARQL JSON], [CSV][SPARQL 1.1 Query Results CSV and TSV Formats], [TSV][SPARQL 1.1 Query Results CSV and TSV Formats] or HTML. * SPARQL CONSTRUCT or DESCRIBE serialized based on Format, Extension of Mime Type using available RDF Writers (see [Linked Data][]) * SPARQL Client for accessing remote SPARQL endpoints. * SPARQL Update * [Rack][] and [Sinatra][] middleware to perform [HTTP content negotiation][conneg] for result formats * Compatible with any [Rack][] or [Sinatra][] application and any Rack-based framework. * Helper method for describing [SPARQL Service Description][SSD] * Implementation Report: {file:etc/earl.html EARL} * Compatible with Ruby >= 2.6. * Supports Unicode query strings both on all versions of Ruby. * Provisional support for [SPARQL-star][]. ## Description The {SPARQL} gem implements [SPARQL 1.1 Query][], and [SPARQL 1.1 Update][], and provides [Rack][] and [Sinatra][] middleware to provide results using [HTTP Content Negotiation][conneg]. * {SPARQL::Grammar} implements a [SPARQL 1.1 Query][] and [SPARQL 1.1 Update][] parser generating [SPARQL S-Expressions (SSE)][SSE]. * {SPARQL::Algebra} executes SSE against Any `RDF::Graph` or `RDF::Repository`, including compliant [RDF.rb][] repository adaptors such as [RDF::DO][] and [RDF::Mongo][]. * {Rack::SPARQL} and {Sinatra::SPARQL} provide middleware components to format results using an appropriate format based on [HTTP content negotiation][conneg]. ### [SPARQL 1.1 Query][] Extensions and Limitations The {SPARQL} gem uses the [SPARQL 1.1 Query][] {file:etc/sparql11.html EBNF grammar}, which provides much more capability than [SPARQL 1.0][], but has a few limitations: * The format for decimal datatypes has changed in [RDF 1.1][]; they may no longer have a trailing ".", although they do not need a leading digit. * BNodes may now include extended characters, including ".". The SPARQL gem now implements the following [SPARQL 1.1 Query][] operations: * [Functions](https://www.w3.org/TR/sparql11-query/#SparqlOps) * [BIND](https://www.w3.org/TR/sparql11-query/#bind) * [GROUP BY](https://www.w3.org/TR/sparql11-query/#groupby) * [Aggregates](https://www.w3.org/TR/sparql11-query/#aggregates) * [Subqueries](https://www.w3.org/TR/sparql11-query/#subqueries) * [Inline Data](https://www.w3.org/TR/2013/REC-sparql11-query-20130321/#inline-data) * [Inline Data](https://www.w3.org/TR/2013/REC-sparql11-query-20130321/#inline-data) * [Exists](https://www.w3.org/TR/2013/REC-sparql11-query-20130321/#func-filter-exists) * [Negation](https://www.w3.org/TR/2013/REC-sparql11-query-20130321/#negation) * [Property Paths](https://www.w3.org/TR/2013/REC-sparql11-query-20130321/#propertypaths) The gem also includes the following [SPARQL 1.1 Update][] operations: * [Graph Update](https://www.w3.org/TR/sparql11-update/#graphUpdate) * [Graph Management](https://www.w3.org/TR/sparql11-update/#graphManagement) Not supported: * [Federated Query][SPARQL 1.1 Federated Query], * [Entailment Regimes][SPARQL 1.1 Entailment Regimes], * [Protocol][SPARQL 1.1 Protocol], and * [Graph Store HTTP Protocol][SPARQL 1.1 Graph Store HTTP Protocol] either in this, or related gems. ### Updates for RDF 1.1 Starting with version 1.1.2, the SPARQL gem uses the 1.1 version of the [RDF.rb][], which adheres to [RDF 1.1 Concepts](https://www.w3.org/TR/rdf11-concepts/) rather than [RDF 1.0](https://www.w3.org/TR/rdf-concepts/). The main difference is that there is now no difference between a _Simple Literal_ (a literal with no datatype or language) and a Literal with datatype _xsd:string_; this causes some minor differences in the way in which queries are understood, and when expecting different results. Additionally, queries now take a block, or return an `Enumerator`; this is in keeping with much of the behavior of [RDF.rb][] methods, including `Queryable#query`, and with version 1.1 or [RDF.rb][], Query#execute. As a consequence, all queries which used to be of the form `query.execute(repository)` may equally be called as `repository.query(query)`. Previously, results were returned as a concrete class implementing `RDF::Queryable` or `RDF::Query::Solutions`, these are now `Enumerators`. ### SPARQL Extension Functions Extension functions may be defined, which will be invoked during query evaluation. For example: # Register a function using the IRI crypt_iri = RDF::URI("https://rubygems#crypt") SPARQL::Algebra::Expression.register_extension(crypt_iri) do |literal| raise TypeError, "argument must be a literal" unless literal.literal? RDF::Literal(literal.to_s.crypt) end Then, use the function in a query: PREFIX rsp: PREFIX schema: SELECT ?crypted { [ schema:email ?email] BIND(rsp:crypt(?email) AS ?crypted) } See {SPARQL::Algebra::Expression.register_extension} for details. ### SPARQLStar (SPARQL-star) The gem supports [SPARQL-star][] where patterns may include sub-patterns recursively, for a kind of Reification. For example, the following Turtle* file uses a statement as the subject of another statement: @prefix : . @prefix foaf: . @prefix ex: . :bob foaf:name "Bob" . <<:bob foaf:age 23>> ex:certainty 0.9 . This can be queried using the following query: PREFIX : PREFIX foaf: PREFIX ex: SELECT ?age ?c WHERE { ?bob foaf:name "Bob" . <> ex:certainty ?c . } This treats `<<:bob foaf:age 23>>` as a subject resource, and the pattern `<>` to match that resource and bind the associated variables. **Note: This feature is subject to change or elimination as the standards process progresses.** #### BIND There is an alternate syntax using the `BIND` operator: PREFIX : PREFIX foaf: PREFIX dct: SELECT ?a ?b ?c WHERE { ?bob foaf:name "Bob" . BIND( <> AS ?a ) . ?t ?b ?c . } When binding, the triple can be either in Property Graph (`:PG`) or Separate Assertions (`:SA`) mode, as the query matches based on the pattern matching as a subject (or object) and does not need to be specifically asserted in the graph. When parsing in Property Graph mode, such triples will also be added to the enclosing graph. Thus, querying for `<>` and `?bob foaf:age ?age` may not represent the same results. When binding an embedded triple to a variable, it is the matched triples which are bound, not the pattern. Thus, the example above with `SELECT ?a ?b ?c` would end up binding `?a` to `:bob foaf:name 23`. #### Construct As well as a `CONSTRUCT`: PREFIX : PREFIX foaf: PREFIX dct: CONSTRUCT { ?bob foaf:name "Bob" . <> ?b ?c . } WHERE { ?bob foaf:name "Bob" . <> ?b ?c . } Note that results can be serialized only when the format supports [RDF-star][]. #### SPARQL results The SPARQL results formats are extended to serialize embedded triples as described for [RDF4J](https://rdf4j.org/documentation/programming/rdfstar/): { "head" : { "vars" : [ "a", "b", "c" ] }, "results" : { "bindings": [ { "a" : { "type" : "triple", "value" : { "s" : { "type" : "uri", "value" : "http://example.org/bob" }, "p" : { "type" : "uri", "value" : "http://xmlns.com/foaf/0.1/name" }, "o" : { "datatype" : "http://www.w3.org/2001/XMLSchema#integer", "type" : "literal", "value" : "23" } } }, "b": { "type": "uri", "value": "http://example.org/certainty" }, "c" : { "datatype" : "http://www.w3.org/2001/XMLSchema#decimal", "type" : "literal", "value" : "0.9" } } ] } } ### Middleware {Rack::SPARQL} is a superset of [Rack::LinkedData][] to allow content negotiated results to be returned any `RDF::Enumerable` or an enumerator extended with `RDF::Query::Solutions` compatible results. You would typically return an instance of `RDF::Graph`, `RDF::Repository` or an enumerator extended with `RDF::Query::Solutions` from your Rack application, and let the `Rack::SPARQL::ContentNegotiation` middleware take care of serializing your response into whatever format the HTTP client requested and understands. {Sinatra::SPARQL} is a thin Sinatra-specific wrapper around the {Rack::SPARQL} middleware, which implements SPARQL content negotiation for Rack applications. {Sinatra::SPARQL} also supports [SPARQL 1.1 Service Description][]. The middleware queries [RDF.rb][] for the MIME content types of known RDF serialization formats, so it will work with whatever serialization extensions that are currently available for RDF.rb. (At present, this includes support for N-Triples, N-Quads, Turtle, RDF/XML, RDF/JSON, JSON-LD, RDFa, TriG and TriX.) ### Remote datasets A SPARQL query containing `FROM` or `FROM NAMED` (also `UPDATE` or `UPDATE NAMED`) will load the referenced IRI unless the repository already contains a graph with that same IRI. This is performed using [RDF.rb][] `RDF::Util::File.open_file` passing HTTP Accept headers for various available RDF formats. For best results, require [Linked Data][] to enable a full set of RDF formats in the `GET` request. Also, consider overriding `RDF::Util::File.open_file` with an implementation with support for HTTP Get headers (such as `Net::HTTP`). Queries using datasets are re-written to use the identified graphs for `FROM` and `FROM NAMED` by filtering the results, allowing the use of a repository that contains many graphs without confusing information. ### Result formats `SPARQL.serialize_results` may be used on it's own, or in conjunction with {Rack::SPARQL} or {Sinatra::SPARQL} to provide content-negotiated query results. For basic `SELECT` and `ASK` this includes HTML, XML, CSV, TSV and JSON formats. `DESCRIBE` and `CONSTRUCT` create an `RDF::Graph`, which can be serialized through [HTTP Content Negotiation][conneg] using available RDF writers. For best results, require [Linked Data][] to enable a full set of RDF formats. ## Examples require 'rubygems' require 'sparql' ### Querying a repository with a SPARQL query queryable = RDF::Repository.load("etc/doap.ttl") query = SPARQL.parse("SELECT * WHERE { ?s ?p ?o }") queryable.query(query) do |result| result.inspect end ### Executing a SPARQL query against a repository queryable = RDF::Repository.load("etc/doap.ttl") query = SPARQL.parse("SELECT * WHERE { ?s ?p ?o }") query.execute(queryable) do |result| result.inspect end ### Updating a repository queryable = RDF::Repository.load("etc/doap.ttl") update = SPARQL.parse(%( PREFIX doap: INSERT DATA { doap:implements } ), update: true) update.execute(queryable) ### Rendering solutions as JSON, XML, CSV, TSV or HTML queryable = RDF::Repository.load("etc/doap.ttl") solutions = SPARQL.execute("SELECT * WHERE { ?s ?p ?o }", queryable) solutions.to_json #to_xml #to_csv #to_tsv #to_html ### Parsing a SPARQL query string to SSE query = SPARQL.parse("SELECT * WHERE { ?s ?p ?o }") query.to_sxp #=> (bgp (triple ?s ?p ?o)) ### Parsing a SSE to SPARQL query or update string to SPARQL # Note: if the SSE uses extension functions, they either must be XSD casting functions, or custom functions which are registered extensions. (See [SPARQL Extension Functions](#sparql-extension-functions)) query = SPARQL::Algebra.parse(%{(bgp (triple ?s ?p ?o))}) sparql = query.to_sparql #=> "SELECT * WHERE { ?s ?p ?o }" ### Command line processing sparql execute --dataset etc/doap.ttl etc/from_default.rq sparql execute -e "SELECT * FROM WHERE { ?s ?p ?o }" # Generate SPARQL Algebra Expression (SSE) format sparql parse etc/input.rq sparql parse -e "SELECT * WHERE { ?s ?p ?o }" # Generate SPARQL Query from SSE sparql parse --sse etc/input.sse --format sparql sparql parse --sse --format sparql -e "(dataset () (bgp (triple ?s ?p ?o))))" # Run query using SSE input sparql execute --dataset etc/doap.ttl --sse etc/input.sse sparql execute --sse -e "(dataset () (bgp (triple ?s ?p ?o))))" # Run a local SPARQL server using a dataset sparql server etc/doap.ttl ### Adding SPARQL content negotiation to a Rails 3.x application # config/application.rb require 'rack/sparql' class Application < Rails::Application config.middleware.use Rack::SPARQL::ContentNegotiation end ### Adding SPARQL content negotiation to a Rackup application #!/usr/bin/env rackup require 'rack/sparql' repository = RDF::Repository.new do |graph| graph << [RDF::Node.new, RDF::Vocab::DC.title, "Hello, world!"] end results = SPARQL.execute("SELECT * WHERE { ?s ?p ?o }", repository) use Rack::SPARQL::ContentNegotiation run lambda { |env| [200, {}, results] } ### Adding SPARQL content negotiation to a classic Sinatra application # Sinatra example # # Call as http://localhost:4567/sparql?query=uri, # where `uri` is the URI of a SPARQL query, or # a URI-escaped SPARQL query, for example: # http://localhost:4567/?query=SELECT%20?s%20?p%20?o%20WHERE%20%7B?s%20?p%20?o%7D require 'sinatra' require 'sinatra/sparql' require 'uri' get '/' do settings.sparql_options.replace(standard_prefixes: true) repository = RDF::Repository.new do |graph| graph << [RDF::Node.new, RDF::Vocab::DC.title, "Hello, world!"] end if params["query"] query = params["query"].to_s.match(/^http:/) ? RDF::Util::File.open_file(params["query"]) : ::URI.decode(params["query"].to_s) SPARQL.execute(query, repository) else settings.sparql_options.merge!(prefixes: { ssd: "http://www.w3.org/ns/sparql-service-description#", void: "http://rdfs.org/ns/void#" }) service_description(repo: repository) end end Find more examples in {SPARQL::Grammar} and {SPARQL::Algebra}. ## Documentation Full documentation available on [Rubydoc.info][SPARQL doc] ### Principle Classes * {SPARQL} * {SPARQL::Algebra} * {SPARQL::Algebra::Expression} * {SPARQL::Algebra::Query} * {SPARQL::Algebra::Operator} * {SPARQL::Grammar} * {SPARQL::Grammar::Parser} * {Sinatra::SPARQL} * {Rack::SPARQL} * {Rack::SPARQL::ContentNegotiation} ## Dependencies * [Ruby](https://ruby-lang.org/) (>= 2.6) * [RDF.rb](https://rubygems.org/gems/rdf) (~> 3.2) * [SPARQL::Client](https://rubygems.org/gems/sparql-client) (~> 3.1) * [SXP](https://rubygems.org/gems/sxp) (~> 1.2) * [Builder](https://rubygems.org/gems/builder) (~> 3.2) * [JSON](https://rubygems.org/gems/json) (~> 2.6) * Soft dependency on [Linked Data][] (>= 3.1) * Soft dependency on [Nokogiri](https://rubygems.org/gems/nokogiri) (~> 1.12) Falls back to REXML for XML parsing Builder for XML serializing. Nokogiri is much more efficient * Soft dependency on [Equivalent XML](https://rubygems.org/gems/equivalent-xml) (>= 0.6) Equivalent XML performs more efficient comparisons of XML Literals when Nokogiri is included * Soft dependency on [Rack][] (~> 2.2) * Soft dependency on [Sinatra][] (~> 2.1) ## Installation The recommended installation method is via [RubyGems](https://rubygems.org/). To install the latest official release of the `SPARQL` gem, do: % [sudo] gem install sparql ## Download To get a local working copy of the development repository, do: % git clone git://github.com/ruby-rdf/sparql.git ## Mailing List * ## Authors * [Gregg Kellogg](https://github.com/gkellogg) - * [Arto Bendiken](https://github.com/artob) - * [Pius Uzamere](https://github.com/pius) - ## Contributing This repository uses [Git Flow](https://github.com/nvie/gitflow) to mange development and release activity. All submissions _must_ be on a feature branch based on the _develop_ branch to ease staging and integration. * Do your best to adhere to the existing coding conventions and idioms. * Don't use hard tabs, and don't leave trailing whitespace on any line. * Do document every method you add using [YARD][] annotations. Read the [tutorial][YARD-GS] or just look at the existing code for examples. * Don't touch the `.gemspec`, `VERSION` or `AUTHORS` files. If you need to change them, do so on your private branch only. * Do feel free to add yourself to the `CREDITS` file and the corresponding list in the the `README`. Alphabetical order applies. * Do note that in order for us to merge any non-trivial changes (as a rule of thumb, additions larger than about 15 lines of code), we need an explicit [public domain dedication][PDD] on record from you, which you will be asked to agree to on the first commit to a repo within the organization. Note that the agreement applies to all repos in the [Ruby RDF](https://github.com/ruby-rdf/) organization. ## License This is free and unencumbered public domain software. For more information, see or the accompanying {file:UNLICENSE} file. A copy of the [SPARQL EBNF][] and derived parser files are included in the repository, which are not covered under the UNLICENSE. These files are covered via the [W3C Document License](https://www.w3.org/Consortium/Legal/2002/copyright-documents-20021231). A copy of the [SPARQL 1.0 tests][] and [SPARQL 1.1 tests][] are also included in the repository, which are not covered under the UNLICENSE; see the references for test copyright information. [Ruby]: https://ruby-lang.org/ [RDF]: https://www.w3.org/RDF/ [RDF::DO]: https://rubygems.org/gems/rdf-do [RDF::Mongo]: https://rubygems.org/gems/rdf-mongo [Rack::LinkedData]: https://rubygems.org/gems/rack-linkeddata [YARD]: https://yardoc.org/ [YARD-GS]: https://rubydoc.info/docs/yard/file/docs/GettingStarted.md [PDD]: https://unlicense.org/#unlicensing-contributions [SPARQL]: https://en.wikipedia.org/wiki/SPARQL [SPARQL 1.0]: https://www.w3.org/TR/sparql11-query/ [SPARQL 1.0 tests]:https://www.w3.org/2001/sw/DataAccess/tests/ [SPARQL 1.1 tests]: https://www.w3.org/2009/sparql/docs/tests/ [SSE]: https://jena.apache.org/documentation/notes/sse.html [SXP]: https://www.rubydoc.info/github/dryruby/sxp [grammar]: https://www.w3.org/TR/sparql11-query/#grammar [RDF 1.1]: https://www.w3.org/TR/rdf11-concepts [RDF.rb]: https://rubydoc.info/github/ruby-rdf/rdf [RDF-star]: https://w3c.github.io/rdf-star/rdf-star-cg-spec.html [SPARQL-star]: https://w3c.github.io/rdf-star/rdf-star-cg-spec.html#sparql-query-language [Linked Data]: https://rubygems.org/gems/linkeddata [SPARQL doc]: https://rubydoc.info/github/ruby-rdf/sparql/frames [SPARQL XML]: https://www.w3.org/TR/rdf-sparql-XMLres/ [SPARQL JSON]: https://www.w3.org/TR/rdf-sparql-json-res/ [SPARQL EBNF]: https://www.w3.org/TR/sparql11-query/#sparqlGrammar [SSD]: https://www.w3.org/TR/sparql11-service-description/ [Rack]: https://rack.github.io [Sinatra]: https://www.sinatrarb.com/ [conneg]: https://en.wikipedia.org/wiki/Content_negotiation [SPARQL 1.1 Query]: https://www.w3.org/TR/sparql11-query/ [SPARQL 1.1 Update]: https://www.w3.org/TR/sparql11-update/ [SPARQL 1.1 Service Description]: https://www.w3.org/TR/sparql11-service-description/ [SPARQL 1.1 Federated Query]: https://www.w3.org/TR/sparql11-federated-query/ [SPARQL 1.1 Query Results JSON Format]: https://www.w3.org/TR/sparql11-results-json/ [SPARQL 1.1 Query Results CSV and TSV Formats]: https://www.w3.org/TR/sparql11-results-csv-tsv/ [SPARQL Query Results XML Format]: https://www.w3.org/TR/rdf-sparql-XMLres/ [SPARQL 1.1 Entailment Regimes]: https://www.w3.org/TR/sparql11-entailment/ [SPARQL 1.1 Protocol]: https://www.w3.org/TR/sparql11-protocol/ [SPARQL 1.1 Graph Store HTTP Protocol]: https://www.w3.org/TR/sparql11-http-rdf-update/