The requirements describe AIMax as a broad inference runtime rather than a text-only engine.
Supported modalities
Text generation, vision encoding, speech-to-text, text-to-speech, text-to-image, and text-to-video workflows are all part of the intended scope.
Hardware targets
CPU backends on x86-64 with AVX2 and AVX-512 plus ARM with NEON, NVIDIA GPUs with CUDA, Apple GPUs with Metal, and cross-vendor GPUs with Vulkan.
CPU SIMD path
Runtime CPU feature detection with AVX2/AVX-512 awareness and AVX2 dequant paths for Q8_0, Q4_K, and Q6_K, designed for byte-exact parity with scalar output.