Deployment Overview
We are seeking a senior infrastructure engineer to optimize and scale our core LLM deployment pipelines. You will design, build, and maintain low-latency vector index caches, secure pipeline checkpoints, and distributed cluster weights.
In this role, you will collaborate closely with AI research teams and data pipeline coordinators to guarantee 99.999% uptime of our cognitive inference gateways under macro scale queries.