Profitero logo

Senior Site Reliability Engineer

Profitero
23 days ago
On-site

About Profitero+


Profitero+ is the leading digital commerce company, trusted by more than 4,000 brands worldwide. We help brands break down silos and turn data into decisive action through intelligence-driven, end-to-end solutions that unify media, content, operations and strategy. Powered by advanced AI, robust digital shelf analytics across 1,400+ retailers in 70 countries and unmatched expertise from digital commerce specialists in 15 global hubs, our integrated solutions help brands accelerate profitable growth.

Come be a part of our fast-paced, entrepreneurial culture and next stage of growth.


Location: Remote role based in Poland


About the team:


You will be part of the Infrastructure/Platform function responsible for reliability, availability, performance, and operational readiness across our cloud environments. Our teams are distributed internationally, with key members located in Eastern Europe.


Key responsibilities:


  • Improve reliability and availability of critical services (production focus);
  • Participate in incident response and post-incident reviews (RCA), and drive corrective/preventive actions;
  • Build and maintain monitoring/alerting and operational dashboards (SLIs/SLOs, on-call readiness);
  • Capacity planning and performance troubleshooting for services and infrastructure components;
  • Maintain and improve Infrastructure-as-Code (Terraform), automation, and self-service tooling;
  • Work closely with developers to improve deployment safety (runbooks, rollbacks, change management);
  • Support Kubernetes-based workloads: reliability, scaling, and operational hygiene;
  • Contribute to security and compliance initiatives (access, secrets, baseline hardening) in collaboration with Security;


We expect:


  • Hands-on experience with GCP core services and cloud operations best practices;
  • Solid experience with Linux administration and troubleshooting;
  • Practical knowledge of Kubernetes and containerized workloads (Docker);
  • Experience with Terraform (modules/resources) or a similar IaC approach;
  • Experience with monitoring/observability tools (Cloud Monitoring, Prometheus, Grafana, ELK, etc.);
  • Understanding of incident management practices (triage, mitigation, RCA, follow-ups);
  • Scripting for automation (Bash / Python / Go) at a practical level;
  • English level sufficient for written communication and cross-team collaboration (B2+);
  • Location: only Poland;


Nice to have:


  • Experience with GitOps/Argo CD and CI/CD pipelines;
  • Knowledge of cloud networking fundamentals (VPC, routing, IAM, firewalls);
  • Experience with cost optimization / FinOps practices;
  • Experience supporting large-scale databases (MySQL) or data warehouses (Snowflake/BigQuery);
  • Familiarity with security practices (secrets management, encryption, least-privilege access);


What do we offer:


  • Fully paid annual leave in accordance with Polish labor law;
  • Holiday allowance to support your time off;
  • Paid sick leave (80% of the daily base salary);
  • Access to a discounted group life insurance program;
  • Private medical insurance (Medicover);
  • Monthly budget for sports and cultural activities (ZFŚS fund). Option for partial funding of a Multisport card;
  • Access to an online language learning platform;
  • Professional development opportunities covered by the company (seminars/training/conferences/etc.).