Skip to main content

Getting Started

Install

The easiest way to install Bento is with this handy script:

curl -Lsf https://warpstreamlabs.github.io/bento/sh/install | bash

Or you can grab an archive containing Bento from the releases page.

Docker

If you have docker installed you can pull the latest official Bento image with:

docker pull ghcr.io/warpstreamlabs/bento
docker run --rm -v /path/to/your/config.yaml:/bento.yaml ghcr.io/warpstreamlabs/bento

Serverless

For information about serverless deployments of Bento check out the serverless section here.

Run

A Bento stream pipeline is configured with a single config file, you can generate a fresh one with:

bento create > config.yaml

The main sections that make up a config are input, pipeline and output. When you generate a fresh config it'll simply pipe stdin to stdout like this:

input:
stdin: {}

pipeline:
processors: []

output:
stdout: {}

Eventually we'll want to configure a more useful input and output, but for now this is useful for quickly testing processors. You can execute this config with:

bento -c ./config.yaml

Anything you write to stdin will get written unchanged to stdout, cool! Resist the temptation to play with this for hours, there's more stuff to try out.

Next, let's add some processing steps in order to mutate messages. The most powerful one is the mapping processor which allows us to perform mappings, let's add a mapping to uppercase our messages:

input:
stdin: {}

pipeline:
processors:
- mapping: root = content().uppercase()

output:
stdout: {}

Now your messages should come out in all caps, how whacky! IT'S LIKE BENTO IS SHOUTING BACK AT YOU!

You can add as many processing steps as you like, and since processors are what make Bento powerful they are worth experimenting with. Let's create a more advanced pipeline that works with JSON documents:

input:
stdin: {}

pipeline:
processors:
- sleep:
duration: 500ms
- mapping: |
root.doc = this
root.first_name = this.names.index(0).uppercase()
root.last_name = this.names.index(-1).hash("sha256").encode("base64")

output:
stdout: {}

First, we sleep for 500 milliseconds just to keep the suspense going. Next, we restructure our input JSON document by nesting it within a field doc, we map the upper-cased first element of names to a new field first_name. Finally, we map the hashed and base64 encoded value of the last element of names to a new field last_name.

Try running that config with some sample documents:

echo '{"id":"1","names":["celine","dion"]}
{"id":"2","names":["chad","robert","kroeger"]}' | bento -c ./config.yaml

You should see (amongst some logs):

{"doc":{"id":"1","names":["celine","dion"]},"first_name":"CELINE","last_name":"1VvPgCW9sityz5XAMGdI2BTA7/44Wb3cANKxqhiCo50="}
{"doc":{"id":"2","names":["chad","robert","kroeger"]},"first_name":"CHAD","last_name":"uXXg5wCKPjpyj/qbivPbD9H9CZ5DH/F0Q1Twytnt2hQ="}

How exciting! I don't know about you but I'm going to need to lie down for a while. Now that you are a Bento expert might I suggest you peruse these sections to see if anything tickles your fancy?