Talent.com
Cette offre d'emploi n'est pas disponible dans votre pays.
Site Reliability Engineer M / F / X

Site Reliability Engineer M / F / X

TrustpairParis, Île-de-France, France
Il y a plus de 30 jours
Type de contrat
  • Temps plein
Description de poste

Trustpair empowers large global companies to eliminate vendor payment fraud with a market leading account validation automation platform. Trustpair serves over 400 enterprise customers helping finance teams protect against 100% of fraud attacks.

The companys global presence includes offices in New York City Paris and Milan. Our team is composed of 100 employees with 15 different nationalities who are dedicated to payment security. Trustpair raised 20 million euros to accelerate international growth and equip finance leaders with the tools needed to tackle sophisticated fraud tactics such as AI deepfakes cyber attacks and more.

About the role

We are excited to open a new position for a Site Reliability Engineer to join and strengthen our Engineering team. You will work closely with Benjamin our current SRE to ensure the reliability performance and scalability of our infrastructure which supports critical financial services for our clients.

Our platform runs entirely on AWS and Kubernetes managed with Infrastructure as Code (Terraform). Datadog is at the core of our observability stack enabling us to monitor detect and respond to issues quickly to maintain high levels of reliability and performance.

You will help drive operational excellence optimize infrastructure costs and continuously enhance the developer experience through improved CI / CD practices automation and observability. While the core of your mission is infrastructure-focused you will also support and improve our security and compliance efforts (SOC 2 ISO 27001) ensuring our platform remains trustworthy and secure.

This role is open to on-site or full remote for candidates based in France.

What youll do

Infrastructure Reliability & Scalability

  • Manage and evolve our AWS infrastructure and Kubernetes clusters to ensure high availability robust performance and cost efficiency.
  • Support the deployment and operation of AI workloads and models adapting infrastructure and automation to meet their specific requirements.
  • Leverage Terraform and modern DevOps practices to automate and streamline infrastructure deployments and configurations.
  • Continuously improve infrastructure testing methodologies proactively identifying and resolving potential performance bottlenecks or scalability issues.

Observability and Incident Management

  • Enhance our Datadog monitoring platform to proactively detect alert and address issues prioritizing symptom-based alerting to avoid service disruptions.
  • Lead swift troubleshooting efforts reducing Mean Time To Detection (MTTD) and Mean Time To Resolution (MTTR) for production incidents.
  • Implement effective logging tracing and metrics collection to facilitate rapid issue diagnosis and resolution.
  • Security and Compliance

  • Support our ongoing compliance with SOC 2 and ISO 27001 standards integrating security best practices into daily operations and infrastructure.
  • Manage and leverage security tools (AWS Security Hub GuardDuty Datadog SIEM) to identify vulnerabilities improve incident response and maintain a secure environment.
  • Participate in security assessments and audits proposing actionable improvements.
  • Developer Experience & Empowerment

  • Refine and enhance CI / CD pipelines enabling engineering teams to deploy confidently quickly and securely.
  • Provide tooling automation and clear documentation to empower developers improving overall engineering productivity and satisfaction.
  • Maintain and optimize developer staging and sandbox environments for frictionless development workflows.
  • Whats in it for you

  • A collaborative environment with a flat structure where everyones voice is heard
  • Trustpair is in scaling phase with career opportunities in France and internationally
  • Flexible remote policy
  • A talented team with senior engineers you can learn and work with
  • Inclusive environment with cultural diversity and parity
  • Why join Trustpair A list of our perks here!

    Must-Haves :

  • At least 4 years experience in SRE / DevOps / System Engineering
  • Proven track record with AWS Cloud services Kubernetes & Terraform
  • Experience in SaaS solution deployment
  • Experience with high-scalability architecture
  • You know your way around Linux and the Unix Shell
  • Hands on experience with containerized architecture
  • Experience in infrastructure monitoring solution deployment (New Relic Grafana or equivalent)
  • You are fluent in English
  • Strong problem solving and analytical skills ; you can dig into problems troubleshoot and provide solutions
  • Recruitment Process

  • First call with AichaTalent Acquisition (30 mins)
  • Experience interview with Simon CTO & Cofounder (1h)
  • Case study with Simon & Benjamin SRE (1h)
  • Coffee fit with two members of our team (30 mins)
  • Cofounders interview with Alexandre(30 mins)
  • Equal Opportunity Statement

    Trustpairs policy is to provide equal employment opportunity in all of our employment practices without regard to race color religion sex national origin ancestry marital status protected veteran status age individuals with disabilities sexual orientation or gender identity or expression or any other legally protected category.

    Applicants for all positions in Trustpair must be legally authorized to work in the country which they are applying for or be a citizen from Schengen / EU zone. The verification of employment eligibility will be required as a condition of hire.

    Key Skills

    Kubernetes,FMEA,Continuous Improvement,Elasticsearch,Go,Root cause Analysis,Maximo,CMMS,Maintenance,Mechanical Engineering,Manufacturing,Troubleshooting

    Employment Type : Full-Time

    Experience : years

    Vacancy : 1

    Créer une alerte emploi pour cette recherche

    Site Reliability Engineer • Paris, Île-de-France, France