Kafka
Apache Kafka - A distributed event streaming platform
Install / Use
/learn @apache/KafkaREADME
Apache Kafka is an open-source distributed event streaming platform used by thousands of companies for high-performance data pipelines, streaming analytics, data integration, and mission-critical applications.
You need to have Java installed.
We build and test Apache Kafka with Java versions 17 and 25. The release parameter in javac is set to 11 for the clients
and streams modules, and 17 for the rest, ensuring compatibility with their respective
minimum Java versions. Similarly, the release parameter in scalac is set to 11 for the streams modules and 17
for the rest.
Scala 2.13 is the only supported version in Apache Kafka.
Build a JAR and run it
./gradlew jar
Follow instructions in https://kafka.apache.org/quickstart
Build source JAR
./gradlew srcJar
Build aggregated javadoc
./gradlew aggregatedJavadoc --no-parallel
Build javadoc and scaladoc
./gradlew javadoc
./gradlew javadocJar # builds a javadoc jar for each module
./gradlew scaladoc
./gradlew scaladocJar # builds a scaladoc jar for each module
./gradlew docsJar # builds both (if applicable) javadoc and scaladoc jars for each module
Run unit/integration tests
./gradlew test # runs both unit and integration tests
./gradlew unitTest
./gradlew integrationTest
./gradlew test -Pkafka.test.run.flaky=true # runs tests that are marked as flaky
Force re-running tests without code change
./gradlew test --rerun-tasks
./gradlew unitTest --rerun-tasks
./gradlew integrationTest --rerun-tasks
Running a particular unit/integration test
./gradlew clients:test --tests RequestResponseTest
./gradlew streams:integration-tests:test --tests RestoreIntegrationTest
Running a particular unit/integration test N times
N=500; I=0; while [ $I -lt $N ] && ./gradlew clients:test --tests RequestResponseTest --rerun --fail-fast; do (( I=$I+1 )); echo "Completed run: $I"; sleep 1; done
Running a particular test method within a unit/integration test
./gradlew core:test --tests kafka.api.ProducerFailureHandlingTest.testCannotSendToInternalTopic
./gradlew clients:test --tests org.apache.kafka.clients.MetadataTest.testTimeToNextUpdate
./gradlew streams:integration-tests:test --tests org.apache.kafka.streams.integration.RestoreIntegrationTest.shouldRestoreNullRecord
Running a particular unit/integration test with log4j output
By default, there will be only a small number of logs output while testing. You can adjust it by changing the log4j2.yaml file in the module's src/test/resources directory.
For example, if you want to see more logs for clients project tests, you can modify the line in clients/src/test/resources/log4j2.yaml
to level: INFO and then run:
./gradlew cleanTest clients:test --tests NetworkClientTest
And you should see INFO level logs in the file under the clients/build/test-results/test directory.
Specifying test retries
Retries are disabled by default, but you can set maxTestRetryFailures and maxTestRetries to enable retries.
The following example declares -PmaxTestRetries=1 and -PmaxTestRetryFailures=3 to enable a failed test to be retried once, with a total retry limit of 3.
./gradlew test -PmaxTestRetries=1 -PmaxTestRetryFailures=3
See Test Retry Gradle Plugin and build.yml for more details.
Generating test coverage reports
Generate coverage reports for the whole project:
./gradlew reportCoverage -PenableTestCoverage=true -Dorg.gradle.parallel=false
Generate coverage for a single module, i.e.:
./gradlew clients:reportCoverage -PenableTestCoverage=true -Dorg.gradle.parallel=false
Coverage reports are located within the module's build directory, categorized by module type:
Core Module (:core): core/build/reports/scoverageTest/index.html
Other Modules: <module>/build/reports/jacoco/test/html/index.html
Building a binary release gzipped tarball
./gradlew clean releaseTarGz
The release file can be found inside ./core/build/distributions/.
Building auto-generated messages
Sometimes it is only necessary to rebuild the RPC auto-generated message data when switching between branches, as they could fail due to code changes. You can just run:
./gradlew processMessages processTestMessages
See Apache Kafka Message Definitions for details on Apache Kafka message protocol.
Running a Kafka broker
Using compiled files:
KAFKA_CLUSTER_ID="$(./bin/kafka-storage.sh random-uuid)"
./bin/kafka-storage.sh format --standalone -t $KAFKA_CLUSTER_ID -c config/server.properties
./bin/kafka-server-start.sh config/server.properties
Using docker image:
docker run -p 9092:9092 apache/kafka:latest
See docker/README.md for detailed information.
Cleaning the build
./gradlew clean
Running a task for a specific project
This is for core, examples and clients
./gradlew core:jar
./gradlew core:test
Streams has multiple sub-projects, but you can run all the tests:
./gradlew :streams:testAll
Listing all gradle tasks
./gradlew tasks
Building IDE project
Note: Please ensure that JDK 17 is used when developing Kafka.
IntelliJ supports Gradle natively and it will automatically check Java syntax and compatibility for each module, even if
the Java version shown in the Structure > Project Settings > Modules may not be the correct one.
When it comes to Eclipse, run:
./gradlew eclipse
The eclipse task has been configured to use ${project_dir}/build_eclipse as Eclipse's build directory. Eclipse's default
build directory (${project_dir}/bin) clashes with Kafka's scripts directory and we don't use Gradle's build directory
to avoid known issues with this configuration.
Publishing the streams quickstart archetype artifact to maven
For the Streams archetype project, one cannot use gradle to upload to maven; instead the mvn deploy command needs to be called at the quickstart folder:
cd streams/quickstart
mvn deploy
Please note for this to work you should create/update user maven settings (typically, ${USER_HOME}/.m2/settings.xml) to assign the following variables
<settings xmlns="http://maven.apache.org/SETTINGS/1.0.0"
xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance"
xsi:schemaLocation="http://maven.apache.org/SETTINGS/1.0.0
https://maven.apache.org/xsd/settings-1.0.0.xsd">
...
<servers>
...
<server>
<id>apache.snapshots.https</id>
<username>${maven_username}</username>
<password>${maven_password}</password>
</server>
<server>
<id>apache.releases.https</id>
<username>${maven_username}</username>
<password>${maven_password}</password>
</server>
...
</servers>
...
Installing all projects to the local Maven repository
./gradlew -PskipSigning=true publishToMavenLocal
Installing specific projects to the local Maven repository
./gradlew -PskipSigning=true :streams:publishToMavenLocal
Building the test JAR
./gradlew testJar
Running code quality checks
There are two code quality analysis tools that we regularly run, SpotBugs and Checkstyle.
Checkstyle
Checkstyle enforces a consistent coding style in Kafka. You can run Checkstyle using:
./gradlew checkstyleMain checkstyleTest spotlessCheck
The Checkstyle warnings will be found in reports/checkstyle/reports/main.html and reports/checkstyle/reports/test.html files in the
subproject build directories. They are also printed to the console. The build will fail if Checkstyle fails.
For experiments (or regression testing purposes) add -PcheckstyleVersion=X.y.z switch (to override project-defined checkstyle version).
Spotless
The import order is a part of static check. Please call spotlessApply to optimize Java imports before filing a pull request.
./gradlew spotlessApply
SpotBugs
SpotBugs uses static analysis to look for bugs in the code. You can run SpotBugs using:
./gradlew spotbugsMain spotbugsTest -x test
The SpotBugs warnings will be found in reports/spotbugs/main.html and reports/spotbugs/test.html files in the subproject build
directories. Use -PxmlSpotBugsReport=true to generate an XML report instead of an HTML one.
JMH microbenchmarks
We use JMH to write microbenchmarks that produce reliable results in the JVM.
See jmh-benchmarks/README.md for details on how to run the microbenchmarks.
Dep
Related Skills
node-connect
337.3kDiagnose OpenClaw node connection and pairing failures for Android, iOS, and macOS companion apps
frontend-design
83.2kCreate distinctive, production-grade frontend interfaces with high design quality. Use this skill when the user asks to build web components, pages, or applications. Generates creative, polished code that avoids generic AI aesthetics.
openai-whisper-api
337.3kTranscribe audio via OpenAI Audio Transcriptions API (Whisper).
commit-push-pr
83.2kCommit, push, and open a PR
