Job Description

Company OverviewFlux is pioneering a new class of AI accelerators called Optical Tensor Processing Units (OTPUs). We’ve already developed functioning prototypes and are now scaling our operations in London. Our work environment rewards innovation, speed, and bold thinking.The roleWe’re hiring Senior and Staff Software Engineers to build the high-performance computing infrastructure that powers our Optical Tensor Processing Units (OTPUs). This isn’t just about scaling models—it’s about rethinking how AI workloads are executed at speed and scale.You’ll lead the design and implementation of software systems that run distributed, low-latency inference across clusters. You’ll work closely with hardware and ML teams to optimise every layer of the stack—from model representation and execution to data movement and scheduling. Whether it’s through compiler techniques, systems-level tuning, or custom runtime design, you’ll play a critical role in shaping the performance layer of our AI platform. This is a role for engineers who think in microseconds, not just model accuracy. If you’ve worked in HFT, large-scale scientific compute, or AI infrastructure at serious scale, we’d love to talk.ResponsibilitiesDesign and build high-performance systems for running AI/ML workloads across distributed compute clustersOptimise for ultra-low latency and real-time inference at scale—profiling, tuning, and rewriting critical systems as neededIdentify and resolve performance bottlenecks across the stack, from model execution and scheduling to hardware-level constraintsCollaborate with compiler engineers to improve code generation, execution paths, and memory layouts using tools like LLVM or MLIRWork with hardware teams to ensure the software stack fully leverages the capabilities of our OTPU architectureExtend ML frameworks (e.g. PyTorch, ONNX, OpenXLA) to better support performance-critical inference pathsLead design reviews, mentor engineers, and promote best practices in HPC and performance engineeringStay on the frontier of new developments in AI infrastructure, compute systems, and compiler toolingSkills & Experience5+ years of experience building performance-critical systems in HPC, HFT, large-scale simulation, or AI infrastructureDeep understanding of distributed systems, with a focus on real-time or near real-time data processingStrong programming skills in C++ and Python, especially for performance-sensitive applicationsHands-on experience with ML compilers (e.g. LLVM, MLIR), and knowledge of runtime and scheduling optimisationsPractical knowledge of ML frameworks like PyTorch, ONNX, or OpenXLA, and how to optimise their executionExperience scaling AI workloads across clusters or custom infrastructure—not just deploying on standard cloud setupsStrong debugging, profiling, and performance-tuning skills across the stackDegree in Computer Science, Engineering, Mathematics, or a related fieldDetailsCompetitive salary ranging from £145k+, depending on experience.Stock options in a rapidly growing AI company.Based in our new 5,000 sq. ft. office in the AI hub of Kings Cross, London.Flux hires candidates within a 45-minute commute of our office—offering an extra £24k/year incentive if you choose to live within 20 minutes. The Comprehensive healthcare insurance.25 days PTO policy plus bank holidays.Private use of our 3D printer.If you’re passionate about compilers, high-performance computing, and redefining what’s possible in AI, we’d love to talk. Apply now to join Flux and help shape the future of optical computing.

Job Title

Company : Flux Computing

Location : London, England

Created : 2025-04-21

Job Type : Full Time