Kafka output plugin for Embulk

Compatibility

embulk-output-kafka	embulk
0.4.x	0.11.x or later
0.3.x	0.9.x or later

Overview

Plugin type: output
Load all or nothing: no
Resume supported: no
Cleanup supported: yes

Configuration

broker: kafka broker host and port (array(string), required)
topic: target topic name (string, required)
topic_column: use column value as target topic (string, default: null)
schema_registry_url: Schema Registy URL that is needed for avro format (string, default: null)
serialize_format: use column value as target topic (enum, required, json or avro_with_schema_registry)
avsc_file: avro schema file path (string, default: null)
avsc: inline avro schema config (json, default: null)
subject_name: subject name for schema_registry (string, default: null)
ignore_columns: remove columns from output (array(string), default: [])
key_column_name: use column value as record key (string, default: null, it can use columns in ignore_columns)
partition_column_name: use column value as partition id (string, default: null, this value is prefer to key_column_name, and if partition_column value is null, use key_column for partitioning)
column_for_deletion: Determine to delete (string, default: null, column_for_deletion column must be boolean. If the value of the column is true, KafkaProducer sends null value to a Kafka Broker.)
record_batch_size: kafka producer record batch size (integer, default: 1000)
acks: kafka producer require acks (string, default: "1")
retries: kafka producer max retry count (integer, default: 1)
other_producer_configs: other producer configs (json, default: {})
value_subject_name_strategy: Set SchemaRegistry subject name strategy (string, default: null, ex. io.confluent.kafka.serializers.subject.RecordNameStrategy)

If use avro_with_schema_registry format, following configs are required.

schema_registry_url

If avsc and avsc_file are null, embulk-output-kafka fetch a schema from schema registry. But currently, embulk-output-kafka supports only TopicNameStrategy. If you want to use another subject name, use subject_name parameter.

Example

in:
  type: file
  path_prefix: ./src/test/resources/in1
  parser:
    charset: UTF-8
    newline: CRLF
    type: csv
    delimiter: ','
    quote: '"'
    escape: '"'
    null_string: 'NULL'
    skip_header_lines: 1
    columns:
    - {name: 'id', type: string}
    - {name: 'int_item', type: long}
    - {name: 'varchar_item', type: string}

out:
  type: kafka
  topic: "json-topic"
  serialize_format: json
  brokers:
    - "localhost:9092"

in:
  type: file
  path_prefix: ./src/test/resources/in_complex
  parser:
    charset: UTF-8
    newline: CRLF
    type: csv
    delimiter: "\t"
    quote: "\0"
    escape: "\0"
    null_string: 'NULL'
    skip_header_lines: 1
    columns:
    - {name: 'id', type: string}
    - {name: 'int_item', type: long}
    - {name: 'time', type: timestamp, format: "%Y-%m-%dT%H:%M:%S"}
    - {name: 'array', type: json}
    - {name: 'data', type: json}

out:
  type: kafka
  topic: "avro-complex-topic"
  acks: all
  retries: 3
  brokers:
    - "localhost:9092"
  schema_registry_url: "http://localhost:48081/"
  serialize_format: avro_with_schema_registry
  other_producer_configs:
    buffer.memory: "67108864"
  avsc:
    type: record
    name: ComplexRecord
    fields: [
      {name: "id", type: "string"},
      {name: "int_item", type: "long"},
      {name: "time", type: "long", logicalType: "timestamp-milli"},
      {name: "array", type: {type: "array", items: "long"}},
      {name: "data", type: {type: "record", name: "InnerData", fields: [
        {name: "hoge", type: "string"},
        {name: "aaa", type: ["null", "string"]},
        {name: "array", type: {type: "array", items: "long"}},
      ]}},
    ]

Build

$ ./gradlew gem  # -t to watch change of files and rebuild continuously

Name		Name	Last commit message	Last commit date
Latest commit History 87 Commits
.github		.github
config/checkstyle		config/checkstyle
examples		examples
gradle		gradle
lib/embulk/output		lib/embulk/output
src		src
.gitignore		.gitignore
LICENSE.txt		LICENSE.txt
README.md		README.md
build.gradle		build.gradle
docker-compose.yml		docker-compose.yml
gradle.lockfile		gradle.lockfile
gradlew		gradlew
gradlew.bat		gradlew.bat
settings.gradle		settings.gradle

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Kafka output plugin for Embulk

Compatibility

Overview

Configuration

Example

Build

About

Releases

Packages

Contributors 2

Languages

License

joker1007/embulk-output-kafka

Folders and files

Latest commit

History

Repository files navigation

Kafka output plugin for Embulk

Compatibility

Overview

Configuration

Example

Build

About

Topics

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

Packages