:plugin: useragent
:type: filter
///////////////////////////////////////////
START - GENERATED VARIABLES, DO NOT EDIT!
///////////////////////////////////////////
:version: %VERSION%
:release_date: %RELEASE_DATE%
:changelog_url: %CHANGELOG_URL%
:include_path: ../../../../logstash/docs/include
///////////////////////////////////////////
END - GENERATED VARIABLES, DO NOT EDIT!
///////////////////////////////////////////
[id="plugins-{type}s-{plugin}"]
=== Useragent filter plugin
include::{include_path}/plugin_header.asciidoc[]
==== Description
Parse user agent strings into structured data based on BrowserScope data
UserAgent filter, adds information about user agent like family, operating
system, version, and device
Logstash releases ship with the regexes.yaml database made available from
ua-parser with an Apache 2.0 license. For more details on ua-parser, see
.
[id="plugins-{type}s-{plugin}-options"]
==== Useragent Filter Configuration Options
This plugin supports the following configuration options plus the <> described later.
[cols="<,<,<",options="header",]
|=======================================================================
|Setting |Input type|Required
| <> |<>|No
| <> |<>|No
| <> |<>|No
| <> |<>|Yes
| <> |<>|No
|=======================================================================
Also see <> for a list of options supported by all
filter plugins.
[id="plugins-{type}s-{plugin}-lru_cache_size"]
===== `lru_cache_size`
* Value type is <>
* Default value is `1000`
UA parsing is surprisingly expensive. This filter uses an LRU cache to take advantage of the fact that
user agents are often found adjacent to one another in log files and rarely have a random distribution.
The higher you set this the more likely an item is to be in the cache and the faster this filter will run.
However, if you set this too high you can use more memory than desired.
Experiment with different values for this option to find the best performance for your dataset.
This MUST be set to a value > 0. There is really no reason to not want this behavior, the overhead is minimal
and the speed gains are large.
It is important to note that this config value is global. That is to say all instances of the user agent filter
share the same cache. The last declared cache size will 'win'. The reason for this is that there would be no benefit
to having multiple caches for different instances at different points in the pipeline, that would just increase the
number of cache misses and waste memory.
[id="plugins-{type}s-{plugin}-prefix"]
===== `prefix`
* Value type is <>
* Default value is `""`
A string to prepend to all of the extracted keys
[id="plugins-{type}s-{plugin}-regexes"]
===== `regexes`
* Value type is <>
* There is no default value for this setting.
`regexes.yaml` file to use
If not specified, this will default to the `regexes.yaml` that ships
with logstash.
You can find the latest version of this here:
[id="plugins-{type}s-{plugin}-source"]
===== `source`
* This is a required setting.
* Value type is <>
* There is no default value for this setting.
The field containing the user agent string. If this field is an
array, only the first value will be used.
[id="plugins-{type}s-{plugin}-target"]
===== `target`
* Value type is <>
* There is no default value for this setting.
The name of the field to assign user agent data into.
If not specified user agent data will be stored in the root of the event.
[id="plugins-{type}s-{plugin}-common-options"]
include::{include_path}/{type}.asciidoc[]