Skip to main content

protobuf

Performs conversions to or from a protobuf message. This processor uses reflection, meaning conversions can be made directly from the target .proto files.

# Config fields, showing default values
label: ""
protobuf:
operator: "" # No default (required)
message: "" # No default (required)
discard_unknown: false
use_proto_names: false
import_paths: []

The main functionality of this processor is to map to and from JSON documents, you can read more about JSON mapping of protobuf messages here: https://developers.google.com/protocol-buffers/docs/proto3#json

Using reflection for processing protobuf messages in this way is less performant than generating and using native code. Therefore when performance is critical it is recommended that you use Bento plugins instead for processing protobuf messages natively, you can find an example of Bento plugins at https://github.com/warpstreamlabs/bento-plugin-example

Operators

to_json

Converts protobuf messages into a generic JSON structure. This makes it easier to manipulate the contents of the document within Bento.

from_json

Attempts to create a target protobuf message from a generic JSON structure.

Examples

If we have the following protobuf definition within a directory called testing/schema:

syntax = "proto3";
package testing;

import "google/protobuf/timestamp.proto";

message Person {
string first_name = 1;
string last_name = 2;
string full_name = 3;
int32 age = 4;
int32 id = 5; // Unique ID number for this person.
string email = 6;

google.protobuf.Timestamp last_updated = 7;
}

And a stream of JSON documents of the form:

{
"firstName": "caleb",
"lastName": "quaye",
"email": "caleb@myspace.com"
}

We can convert the documents into protobuf messages with the following config:

pipeline:
processors:
- protobuf:
operator: from_json
message: testing.Person
import_paths: [ testing/schema ]

Fields

operator

The operator to execute

Type: string
Options: to_json, from_json.

message

The fully qualified name of the protobuf message to convert to/from.

Type: string

discard_unknown

If true, the from_json operator discards fields that are unknown to the schema.

Type: bool
Default: false

use_proto_names

If true, the to_json operator deserializes fields exactly as named in schema file.

Type: bool
Default: false

import_paths

A list of directories containing .proto files, including all definitions required for parsing the target message. If left empty the current directory is used. Each directory listed will be walked with all found .proto files imported.

Type: array
Default: []