KTensorFlow

KTensorFlow is a Kotlin Multiplatform library designed to run LiteRT (TensorFlow Lite) neural network models from common code. It abstracts platform-specific implementation details, making it easier to load models and run inference across Android and iOS.

Installation
Usage

Installation

First add dependencies:

dependencies {
  // core module, contains Interpreter and other core classes and functions
  implementation("dev.kursor.ktensorflow:ktensorflow-core:1.2")

  // tensors module, adds support for Tensors and allows easy data transformation
  // highly recommended if you don't want to convert your model inputs and outputs to and from ByteArray manually
  implementation("dev.kursor.ktensorflow:ktensorflow-tensor:1.2")

  // gpu module, contains delegate to run inference on the gpu
  implementation("dev.kursor.ktensorflow:ktensorflow-gpu:1.2")

  // npu module, contains delegate to run inference on the npu
  implementation("dev.kursor.ktensorflow:ktensorflow-npu:1.2")

  // pipeline module, contains utils to create pipelines for preprocessing and postprocessing of the data
  implementation("dev.kursor.ktensorflow:ktensorflow-pipeline:1.2")

  // moko module, contains extensions to load models from moko-resources (ModelDesc.FileResource and ModelDesc.AssetResource)
  implementation("dev.kursor.ktensorflow:ktensorflow-moko:1.2")

  // compose module, contains extension to load models from compose-resources (ModelDesc.ComposeUri)
  implementation("dev.kursor.ktensorflow:ktensorflow-compose:1.2")
}

To link TensorFlow Lite binaries to iOS you need to add Linking plugin

plugins {
  id("dev.kursor.ktensorflow.link") version "1.2"
}

Currently, this library only supports projects, that are being linked to iOS app via CocoaPods

Usage

Load the model

First, you need to create ModelDesc, that would provide model to the library. By default ModelDesc needs to be created in platform-specific code, since Android and iOS have different ways of loading the model. But there are extensions for Moko and Compose Resources in ktensorflow-moko and ktensorflow-compose modules.

Android examples with Compose Multiplatform Resources:

ModelDesc.File

val bytes = Res.readBytes("files/model.tflite")
val tmpFile = File.createTempFile("prefix", "suffix", context.cacheDir)
tmpFile.writeBytes(bytes)
val modelDesc = ModelDesc.File(tmpFile)

ModelDesc.ByteBuffer

val bytes = Res.readBytes("files/model.tflite")
val byteBuffer = ByteBuffer.wrap(bytes).apply { order(ByteOrder.nativeOrder()) }
val modelDesc = ModelDesc.ByteBuffer(byteBuffer)

iOS example with Compose Multiplatform Resources:

ModelDesc.PathInBundle

val modelDesc = ModelDesc.PathInBundle(Res.getUri("files/model.tflite").removePrefix("file://"))

Moko resources extensions

Module ktensorflow-moko contains useful extension functions to create a ModelDesc from moko-resources.

It adds 2 useful functions:

ModelDesc.FileResource(resource: FileResource) to load a model from moko's FileResource
ModelDesc.AssetResource(resource: AssetResource) to load a model from moko's AssetResource

Compose resources extensions

Module ktensorflow-compose contains useful extension function to create ModelDesc from compose-resources

ModelDesc.ComposeUri(uri: String) to load model with Res.getUri(<filePath>)

Run inference

To run the inference, create Interpreter:

val interpreter = Interpreter(
  modelDesc = modelDesc,
  options = InterpreterOptions()
)

Then run the model:

val inputArray = Array(28) {
  FloatArray(28) {
    Random.nextFloat()
  }
}
val input = Tensor<Float>(inputArray)
val output = Tensor<Float>(
  shape = TensorShape(10),
  dataType = TensorDataType.Float32
)
interpreter.run(input, output)
val result = output.argmax()[0]

Note By default, Interpreter.run accepts only ByteArray for inputs and output. Tensor class is provided as separate module, but is highly recommended to use for easier data manipulation.

Data transformation with Tensors

Tensors support arithmetic, and transformation operations, as well as forEach, sum, min, max, argmin, argmax functions. You can create Tensors from multidimensional primitive arrays with:

Tensor<T>(any: Any) - where T is Float, Int, UByte, Long and any: Any is N-dimensional primitive array of these types, for example: Array<Array<FloatArray>>, etc.

And you can transform Tensors to primitive arrays with:

Tensor<T>.toArray<R>() - returns primitive multidimensional array of type R with data from the Tensor

Example:

val tensor1: Tensor<Float> = Tensor(Array(28) { FloatArray(28) { Random.nextFloat() } })
val tensor2: Tensor<Float> = Tensor(Array(28) { FloatArray(28) { Random.nextFloat() } })
val tensor3: Tensor<Float> = tensor1 + tensor2 
val array = tensor3.toArray<Array<FloatArray>>()
val argmax: IntArray = tensor3.argmax()

Tensors support these types which translate to corresponding Kotlin types:

TensorDataType.Float32 -> Float
TensorDataType.Int32 -> Int
TensorDataType.UInt8 -> UByte
TensorDataType.Int64 -> Long

Hardware acceleration

Hardware acceleration is provided by delegates. There are built-in Delegates to run inference on GPU and NPU in modules ktensorflow-gpu and ktensorflow-npu

Delegates can be provided to interpreter using InterpreterOptions

val options = InterpreterOptions(
  numThreads = 4,
  useXNNPack = true,
  delegates = listOf(GpuDelegate())
)

Delegates are provided to the Interpreter as a list of possible variants, and only the first available will be used.

Providing platform-specific options

If you need to provide platform specific option to the Interpreter or GpuDelegate you can use platform-specific builder functions:

Android:

val interpreterOptions = InterpreterOptions { // this: Interpreter.Options
  setUseNNAPI(true)
}

val gpuDelegateOptions = GpuDelegateOptions { // this: GpuDelegateFactory.Options
  setPrecisionLossAllowed(true)
}

val npuDelegateOptions = NpuDelegateOptions { // this: NpuDelegate.Options
  setMaxNumberOfDelegatedPartitions(maxDelegatedPartitions)
}

iOS:

val interpreterOptions = InterpreterOptions(delegates = emptyList()) { // this: TFLInterpreterOptions
  setUseXNNPACK(true)
}

val gpuDelegateOptions = GpuDelegateOptions { // this: TFLMetalDelegateOptions
  setWaitType(TFLMetalDelegateThreadWaitType.TFLMetalDelegateThreadWaitTypeActive)
}

val npuDelegateOptions = NpuDelefateOptions { // this: TFLCoreMLDelegateOptions
  setMaxDelegatedPartitions(maxDelegatedPartitions.toULong())
}

Writing custom delegates

If you need to use a custom delegate that is not yet supported by the library, create a class that would implement Delegate interface

Preprocessing and postprocessing of the data

You can create pipelines to make preprocessing and postprocessing of the data easier

You can create 2 types of pipelines: single input/output and multiple input/output

Single i/o pipeline creation

val pipeline = Pipeline.linear<Array<UByteArray>>()
  .floatify()
  .normalize()
  .tensorize()
  .inference(
    interpreter = interpreter,
    index = 0,
    dataType = TensorDataType.Float32,
    shape = TensorShape(10)
  )
  .argmax()
  .classify(listOf("0", "1", "2", "3", "4", "5", "6", "7", "8", "9"))

Multiple i/o pipeline creation

val pipeline = Pipeline
  .input(
    preprocessing = Stage<Array<UByteArray>>()
      .floatify()
      .normalize()
      .tensorize()
  )
  .inference(interpreter)
  .output(
    index = 0,
    dataType = TensorDataType.Float32,
    shape = TensorShape(10),
    preprocessing = Stage<Tensor>()
      .argmax()
      .classify(listOf("0", "1", "2", "3", "4", "5", "6", "7", "8", "9"))
  )
  .build()

Then you can call

val output = pipeline.run(input)

Library development plan

Add utils for media and text processing

If you have any other suggestions, feel free to create an issue, I'd love to hear your thoughts!

License

Copyright 2025 Sergey Kurochkin

Licensed under the Apache License, Version 2.0 (the "License");
you may not use this file except in compliance with the License.
You may obtain a copy of the License at

    http://www.apache.org/licenses/LICENSE-2.0

Unless required by applicable law or agreed to in writing, software
distributed under the License is distributed on an "AS IS" BASIS,
WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
See the License for the specific language governing permissions and
limitations under the License.

KTensorFlow

Install / Use

README

KTensorFlow

Table of Contents

Installation

Usage

Load the model

Moko resources extensions

Compose resources extensions

Run inference

Data transformation with Tensors

Hardware acceleration

Providing platform-specific options

Writing custom delegates

Preprocessing and postprocessing of the data

Library development plan

License

Related Skills