CAIMAN-ASR enables at-scale ASR, supporting over 1000 real-time streams within stringent latency budgets, reducing CapEx costs by as much as 90%. A single 1U server with one accelerator card running CAIMAN-ASR has the same throughput capacity as twenty unaccelerated servers.
CAIMAN-ASR leverages the parallel processing advantages of Achronix’s Speedster7t® FPGA, the power behind the accelerator cards, to achieve extremely low latency inference. This enables a raft of NLP workloads to be performed in a human-like response time for end-to-end conversational AI.
The complete accelerator stack provided includes all the software required for the WebSocket and a WebSocket API to simplify the interface to existing service provisions.
CAIMAN-ASR runs on industry-standard PCIe accelerator cards, enabling existing racks to be upgraded quickly for up to 20x greater call capacity. The VectorPath® S7t-VG6 Accelerator Card from BittWare is available off-the-shelf today.
CAIMAN-ASR uses as much as 90% less energy to process the same number of real-time streams as an unaccelerated solution, significantly reducing energy costs and enhancing ESG credentials.
CAIMAN-ASR is provided pre-trained for high quality English language transcription. For applications requiring specialist vocabularies or alternative languages, the neural model can easily be retrained with customers’ own large, bespoke datasets using the popular ML framework PyTorch.