Site Reliability Engineer

Site Reliability Engineer

About Inrupt

Sir Tim Berners-Lee created Solid to realize the web as he fully envisioned it. It's an Open Standard that connects people to their data.

Inrupt provides enterprise-grade Solid software and services. Our products are the expression of decades of experience in security, compliance, and operational excellence.

Inrupt powers innovation for the shared benefit of individuals, developers and organizations. We lead a worldwide movement of inventors, investors, technologists, business leaders and governments who are committed to a web that works for everyone.

Governments and corporations are in early stages of deployment, the Company is very well funded and poised for significant scale.

Responsibilities
  • Manage day-to-day operations of AWS EKS clusters across development, staging, and production environments
  • Monitor system health, triage alerts, and respond to incidents (15-minute SLO)
  • Perform regular patching, upgrades, and maintenance of the infrastructure components.
  • Maintain and optimize our technology stack: EKS, MSK, RDS, ArgoCD, Traefik, Sysdig, Mezmo, Terraform
  • Manage AWS services, including VPC, RDS, MSK (Kafka), S3, and networking infrastructure
  • Implement and maintain comprehensive monitoring dashboards, alerting, and centralized logging
  • Maintain Terraform-based infrastructure automation and practice GitOps principles
  • Manage data infrastructure lifecycle: RDS databases, Kafka clusters, Redis caching, S3 buckets
  • Implement security baselines, manage RBAC, conduct vulnerability scanning, and remediation
  • Design and test disaster recovery strategies with defined RTO/RPO
  • Support ArgoCD deployments and troubleshoot application deployment issues
  • Create and maintain documentation and troubleshooting guides
  • Provide architectural reviews and capacity planning aligned with business objectives
  • Optimize infrastructure costs while maintaining performance and reliability
  • Establish on-call rotation and incident response procedures with post-mortem analysis
  • Work closely with the engineers to ensure operational requirements are built into our products
  • Work closely with engineers to ensure that non-functional requirements are met by the proposed architecture, design, and development choices.

About You
  • Experience managing production Kubernetes clusters, preferably AWS EKS
  • Deep knowledge of cloud platform services (e.g EC2, EKS, VPC, RDS, S3, IAM, CloudWatch)
  • Strong Terraform experience for infrastructure automation
  • Experience with monitoring platforms (Sysdig, Datadog, or similar) and logging systems
  • Hands-on experience with ArgoCD or similar tools
  • Strong understanding of networking: VPCs, security groups, load balancers, DNS
  • Database administration experience (PostgreSQL), including backups and performance tuning
  • Experience with message queue systems (Kafka/MSK preferred)
  • Proficiency in Python, Bash, or Go for automation
  • Excellent communication skills with the ability to explain complex technical concepts clearly
  • Ownership mindset with strong problem-solving and analytical skills
  • Experience with security best practices and compliance frameworks (SOC2, GDPR)

Bonus
  • Service mesh experience (Istio, Linkerd, Consul)
  • FinOps practices and cost optimization experience
  • Chaos engineering and resilience testing
  • Multi-region infrastructure experience
  • AWS certifications (Solutions Architect, DevOps Engineer, or Security)
  • CKA (Certified Kubernetes Administrator) certification
  • Experience supporting government or highly regulated industries

How we will support you

We strive to empower our team members to be self-directed and self-motivated in their work.

  • Remote First: We've always been a fully distributed company with team members all over the world.
  • Commitment to Personal Growth: Every team member has an annual budget to invest in their professional development including an annual conference budget.
  • Work/Life Balance: Flexible working hours and unlimited paid time off. We want you to thrive both in and out of the office. We trust you to use good judgment and take the time off that you need to bring your best self to work.
  • Social Events: As a fully remote company it’s important that we get some time together to socialize and get to know one another outside of the day to day projects and meetings we work on.  Therefore, we organize quarterly online social events e.g remote cooking classes, quizzes etc.
  • Work Anniversary Gifts
  • $800 Office Set-Up Allowance

If you think you might thrive in this environment, we would love to hear from you.

Diversity, Equity, and Inclusion

Inrupt provides equal work opportunities to all team members and applicants, and it prohibits discrimination and harassment of any type on the basis of race, color, ethnicity, caste, religion, age, sex (including pregnancy), national origin, disability status, genetics, protected veteran status, sexual orientation, gender identity or expression, or any other characteristic protected by our policies or federal, state, or local laws.

We want to ensure that our hiring process is accessible. If you need reasonable accommodation for any part of the application process because of a medical condition or disability, please send an email to jobs@inrupt.com to let us know the nature of your request.

Additional Considerations
  • Sometimes we meet up! Expect some travel: once a year for our all-hands meetup and occasional team meetings throughout the year, usually in London.
  • A successful candidate will be subject to a background check and must receive satisfactory results of the same, as a condition of joining the team.
  • By applying for this role, you confirm that all information submitted is accurate and complete. You further acknowledge that providing false or fraudulent information during the application process is cause for denial of an offer, revocation of any existing offer, or other adverse action, up to and including termination after the start of your commencement of work.
Email to apply