Your daily routine would include:
- Implement solutions to monitor services’ health and performance, investigating root causes when issues arise, analyzing logs, and apply fixes within SLA guidelines;
- Handle incident reports and service requests through a ticketing platform (like ServiceNow) and collaborate with stakeholders to minimize service impact;
- Plan and coordinate technical changes, upgrades, or new service introductions;
- Develop or customize routines and scripting (PowerShell, Python or similar) to ease operational workloads and enable automation;
- Create custom Management Packs in SCOM with PowerShell;
- Provide input and guidance for designing and optimizing enterprise monitoring solutions in collaboration with service managers and technical teams;
- Perform server and platform administrative activities, such as installations, deployments, configurations, upgrades, patching, and maintenance across PROD and non-PROD environments of the SCOM platform and B2B customer solutions.
We’ll know you can make it if you have:
- Experience in enterprise monitoring solutions (like SCOM, Zabbix, Checkmk, or similar);
- Strong experience with scripting in PowerShell, Python or similar;
- Ability to identify and resolve faults, document processes, and drive continual operational improvements;
- Experience with Microsoft automation technologies and system management tools such as SCCM, MECM is an advantage;
- Experience as Windows System Administrator is an advantage;
- A degree in IT, Telecommunications, or similar;
- Fluent in English, both written and spoken;
Nice-to-haves:
- Experience with Python;
- Experience with SSL Certificates handling;
- Familiarity with Microsoft Azure;