README.md in embulk-input-splunk-0.1.1 vs README.md in embulk-input-splunk-0.1.2
- old
+ new
@@ -1,36 +1,104 @@
# Splunk input plugin for Embulk
A simple plug-in to run a once-off Splunk query and emit the results.
+This plugin loads events as two columns: `time` and `event`. `event` is JSON contain the results of your query. You can use filter plugins such as [embulk-filter-expand_json](https://github.com/civitaspo/embulk-filter-expand_json) or [embulk-filter-add_time](https://github.com/treasure-data/embulk-filter-add_time) to convert the json column to typed columns. [Rename filter](http://www.embulk.org/docs/built-in.html#rename-filter-plugin) is also useful to rename the typed columns.
+
+Note that the time is fetched from Splunk's `_time` field. It is possible to rename or reformat this field in the query in a such a way that this plugin will fail or have unexpected results. It is recommended you do not alter the `_time` in the query unless you know what you're doing.
+
## Overview
-* **Plugin type**: input
-* **Resume supported**: no
-* **Cleanup supported**: no
-* **Guess supported**: no
+- **Plugin type**: input
+- **Resume supported**: no
+- **Cleanup supported**: no
+- **Guess supported**: no
## Configuration
-- **type**: splunk
-- **host**: host of your splunk server (string, required)
-- **username**: splunk username (string, required)
-- **password**: splunk password (string, required)
-- **port**: splunk API port (integer, default: 8089)
-- **query**: the query you wish to run. It should be prefixed with "search" (string required)
+- **type**: splunk
+- **scheme**: HTTP scheme for using the Splunk API (string, default: https)
+- **host**: host of your splunk server (string, required)
+- **username**: splunk username (string, required)
+- **password**: splunk password (string, required)
+- **port**: splunk API port (integer, default: 8089)
+- **query**: the query you wish to run. It should be prefixed with "search" (string required)
+- **earliest_time**: the earliest time for the splunk search. (string, default: nil, which is unbounded)
+- **latest_time**: the latest time for the splunk search. (string, default: nil, which is unbounded)
-## Example
+### Earliest and latest times
+Splunk's required data format is `%Y-%m-%dT%H:%M:%S.%L%:z` which is the required format for `earliest_time` and `latest_time`.
+In addition, Splunk relative time operations are also accepted, such as -1d@d. For more information, see the [Splunk documentation](https://docs.splunk.com/Documentation/Splunk/7.0.2/SearchReference/SearchTimeModifiers)
+
+### Number of returned results
+
+The default Splunk API limits resuts to 100. In this plugin, the limit is not set, so it is possible to generate very large result sets. To limit the number of results, use the `head` or `tail` command in your query.
+
+
+## Examples
+
+Remember the queries much be prefixed with the search command or they are unlikely not to work.
+
+### Unbounded time range
+
```yaml
in:
type: splunk
host: splunk.example.com
username: splunk_user
password: abc123
port: 8089
- query: "search index="main" | head 10"
-out:
- type: stdout
+ query: search index="main"
+```
+
+### Relative time range
+
+```yaml
+in:
+ type: splunk
+ host: splunk.example.com
+ username: splunk_user
+ password: abc123
+ port: 8089
+ query: search index="main"
+ earliest_time: -1m@m
+```
+
+### Absolute time range
+
+```yaml
+in:
+ type: splunk
+ host: splunk.example.com
+ username: splunk_user
+ password: abc123
+ port: 8089
+ query: search index="main"
+ earliest_time: 2017-01-18T19:23:08.237+11:00
+ latest_time: 2018-01-18T19:23:08.237+11:00
+```
+
+### Complex Searches
+
+For those unfamiliar with YAML, the pipe (|) indicates a multiline value. In Splunk the pipe operator is used for creating multi-step processing.
+
+For non-trivial Splunk queries, you should leverage the YAML pipe alongside Splunk pipes for easier to read queries.
+
+```yaml
+in:
+ type: splunk
+ host: splunk.example.com
+ username: splunk_user
+ password: abc123
+ port: 8089
+ query: |
+ search index="main" |
+ eval foo=bar |
+ where like(bar, "%baz%" |
+ head 100
+ earliest_time: 2017-01-18T19:23:08.237+11:00
+ latest_time: 2018-01-18T19:23:08.237+11:00
```
## Build