Inferact's mission is to grow vLLM as the world's AI inference engine and accelerate AI progress by making inference cheaper and faster. Founded by the creators and core maintainers of vLLM, we sit at the intersection of models and hardware—a position that took years to build.

Member of Technical Staff, Inference

Member of Technical Staff, Developer Relations

Member of Technical Staff, Cluster Administration

Member of Technical Staff, TPU & AMD GPU Performance Engineering

Member of Technical Staff, Exceptional Generalist (Remote)

Member of Technical Staff, Cloud Orchestration

Member of Technical Staff, Kernel Engineering















