TFRT

Description

New runtime that will replace the existing TensorFlow runtime.
Responsible for efficient execution of kernels – low-level device-specific primitives – on targeted hardware.

Efficient use of multithreaded host CPUs
Supports fully asynchronous programming models
Low-level efficiency
Help Hardware makers integrate edge and datacenter devices into TensorFlow in a modular way.

TFRT utilizes MLIR’s compiler infrastructure to generate an optimized, target-specific representation of your computational graph that the runtime executes.
TFRT uses MLIR’s extensible type system to support arbitrary C++ types in the runtime, which removes tensor-specific limitations.