Senior Site Reliability Engineer

Profitero

23 days ago

On-site

About Profitero+

Profitero+ is the leading digital commerce company, trusted by more than 4,000 brands worldwide. We help brands break down silos and turn data into decisive action through intelligence-driven, end-to-end solutions that unify media, content, operations and strategy. Powered by advanced AI, robust digital shelf analytics across 1,400+ retailers in 70 countries and unmatched expertise from digital commerce specialists in 15 global hubs, our integrated solutions help brands accelerate profitable growth.

Come be a part of our fast-paced, entrepreneurial culture and next stage of growth.

Location: Remote role based in Poland

About the team:

You will be part of the Infrastructure/Platform function responsible for reliability, availability, performance, and operational readiness across our cloud environments. Our teams are distributed internationally, with key members located in Eastern Europe.

Key responsibilities:

Improve reliability and availability of critical services (production focus);
Participate in incident response and post-incident reviews (RCA), and drive corrective/preventive actions;
Build and maintain monitoring/alerting and operational dashboards (SLIs/SLOs, on-call readiness);
Capacity planning and performance troubleshooting for services and infrastructure components;
Maintain and improve Infrastructure-as-Code (Terraform), automation, and self-service tooling;
Work closely with developers to improve deployment safety (runbooks, rollbacks, change management);
Support Kubernetes-based workloads: reliability, scaling, and operational hygiene;
Contribute to security and compliance initiatives (access, secrets, baseline hardening) in collaboration with Security;

We expect:

Hands-on experience with GCP core services and cloud operations best practices;
Solid experience with Linux administration and troubleshooting;
Practical knowledge of Kubernetes and containerized workloads (Docker);
Experience with Terraform (modules/resources) or a similar IaC approach;
Experience with monitoring/observability tools (Cloud Monitoring, Prometheus, Grafana, ELK, etc.);
Understanding of incident management practices (triage, mitigation, RCA, follow-ups);
Scripting for automation (Bash / Python / Go) at a practical level;
English level sufficient for written communication and cross-team collaboration (B2+);
Location: only Poland;

Nice to have:

Experience with GitOps/Argo CD and CI/CD pipelines;
Knowledge of cloud networking fundamentals (VPC, routing, IAM, firewalls);
Experience with cost optimization / FinOps practices;
Experience supporting large-scale databases (MySQL) or data warehouses (Snowflake/BigQuery);
Familiarity with security practices (secrets management, encryption, least-privilege access);

What do we offer:

Fully paid annual leave in accordance with Polish labor law;
Holiday allowance to support your time off;
Paid sick leave (80% of the daily base salary);
Access to a discounted group life insurance program;
Private medical insurance (Medicover);
Monthly budget for sports and cultural activities (ZFŚS fund). Option for partial funding of a Multisport card;
Access to an online language learning platform;
Professional development opportunities covered by the company (seminars/training/conferences/etc.).

Apply now

Senior Site Reliability Engineer

More jobs

Data Analytics Engineer

Profitero

Senior Platform Engineer

Profitero