AION's Profile Image

Backend Engineer - Compute Platform

Company: AION

Job Location: Bengaluru, Karnataka, India

Job Type: FULL_TIME - (HYBRID)

Date Posted: April 05, 2025

External

Apply Now

About AION

AION is building the next generation of AI cloud platform by transforming the future of high-performance computing (HPC) through its decentralized AI cloud. Purpose-built for bare-metal performance, AION democratizes access to compute power for AI training, fine-tuning, inference, data labeling, and beyond.

By leveraging underutilized resources such as idle GPUs and data centers, AION provides a scalable, cost-effective, and sustainable solution tailored for developers, researchers, and enterprises. The platform's innovative Proof of Compute Contribution (PoCC) protocol rewards contributors based on performance, creating a transparent and efficient ecosystem.

Integrated with Tether (USD₮ & USD₮0) for stability and regulatory clarity, AION eliminates volatility, ensuring predictable costs and seamless transactions. With cutting-edge partnerships and a USD-backed economy, AION is pioneering the commoditization of high-performance compute, empowering global innovation and bridging the AI wealth gap.

Led by high-pedigree founders with previous exits, AION is well-funded by major VCs with strategic global partnerships. Headquartered in the US with global presence, the company is building its initial core team in India.

Who you are

You are a systems-oriented backend developer with deep expertise in designing distributed services and resource management systems. You understand operating system fundamentals and have experience building robust queuing architectures. You're passionate about creating scalable billing and resource allocation systems that can operate reliably under heavy load. You thrive on solving complex problems around service communication, system efficiency, and infrastructure abstraction.

Technical Skills & Experience

  • 5-8 years of experience in backend development (exceptional candidates with different experience profiles will be considered)
  • A Tier1 college education or previous work experience at FAANG/top startups is preferred but not required
  • Background in compute infrastructure at cloud companies like AWS, GCP, Azure, Rubrik, or similar is highly valuable
  • Service Architecture: Strong experience designing microservices, event-driven architectures, and distributed systems
  • Queuing Systems: Deep knowledge of RabbitMQ, Kafka, SQS or similar platforms for building robust asynchronous processing systems
  • Resource Management: Experience with resource allocation, scheduling, and quota management systems
  • Billing Systems: Expertise in building usage-based billing, metering services, and financial reconciliation
  • Operating Systems: Strong understanding of Linux/Unix environments, process management, and system calls
  • Programming: Proficiency in Python (FastAPI/Flask), Go, or Java with emphasis on concurrent programming
  • Database: Skills in database schema design, transaction management, and high-throughput data access patterns
  • Database Systems: Experience with PostgreSQL, Redis, and time-series databases for operational metrics
  • System Monitoring: Knowledge of metrics collection, resource utilization tracking, and alerting systems
  • Performance: Experience optimizing high-throughput backend services and identifying bottlenecks
  • Infrastructure: Understanding of virtualization, containerization, and cloud resource abstractions
  • Web3/Blockchain: Experience with blockchain interactions for financial settlement (preferred but not required)

Key Responsibilities

  1. Responsible for designing scalable service architectures for the compute platform and marketplace.
  2. Develop robust queuing and job management systems for handling compute workloads.
  3. Implement comprehensive resource tracking, allocation, and quota enforcement systems.
  4. Responsible for building usage-based billing systems with precise metering and reconciliation.
  5. Design and implement cost optimization and resource efficiency monitoring services.
  6. Develop distributed worker systems for processing asynchronous operations at scale.
  7. Responsible for designing and implementing fault-tolerant backend services with automated recovery.
  8. Create provider resource abstraction layers that normalize diverse hardware configurations.
  9. Implement performant database access patterns for high-throughput transaction processing.
  10. Design and develop real-time monitoring systems for resource utilization and availability.
  11. Build system integration services that interact with Linux/Unix environments and container runtimes.
  12. Responsible for implementing settlement systems that integrate with blockchain components.
  13. Design and build real-time workload backup and migration services for ephemeral infrastructure.

Location

Individuals in this role are expected to relocate to Bangalore, though exceptions can be made. We offer a hybrid working setup with 3 days in-office setup. Employees would have flexibility to work from anywhere for a few months during a year.

Why Join Us

  • Be part of a mission-driven team at the intersection of web3 and AI, tackling some of the most exciting challenges in the industry.
  • Join the ground floor of a AI startup, with the opportunity to make a significant impact on the company and the industry.
  • Collaborate with top-tier talent from the tech industry.
  • Competitive salary and benefits package.
  • Flexible work environment with opportunities for professional growth and development.

If you are a skilled and motivated Backend Engineer with a passion for building secure, high-performance compute infrastructure, we would love to hear from you.

Don't count the days, make the days count.

“Muhammad Ali”
Apply Now