Site Reliability Engineer, Security Service Edge

Check Point Software Technologies Boston , MA 02298

Posted 6 days ago

Why Join Us?

As the world's leading vendor of Cyber Security, facing the most sophisticated threats and attacks, we've assembled a global team of the most driven, creative, and innovative people. At Check Point, our employees are redefining the security landscape by meeting our customers' real-time needs and providing our cutting-edge technologies and services to an ever-growing customer base.

Check Point Software Technologies has been recognized by Forbes as one of the World's Best Places to Work four years in a row (2020-2023), ranking among the top 50 companies across the globe in the IT category. Check Point has also been named to Forbes' list of World's Top Female-Friendly Companies. If you want to make the world a safer place and join an award-winning company culture - you belong with us.

In this role, you will investigate complex production issues, improve our system resilience and monitoring coverage, support our customer-facing teams with complex technical investigations and implementations and automate methods and procedures to reduce workload and improve stability,

If you are an experienced SRE/DevOps/Network expert, working in a large-scale infrastructure environment and looking to boost up your career by working for one of the industry leaders of the evolving cyber security industry, your place is with us.

Key Responsibilities

  • Lead investigation and collaborate with other group experts to investigate complex cross-function production issues

  • Maintain 100% Monitoring coverage, including building monitoring strategy that alerts on symptoms rather than on outages

  • Reduce workload and improve uptime and SLA response time by implementing automation processes for production issues

  • Act as the R&D extension in North America supporting production critical issues during North American business hours

  • Perform advanced troubleshooting of complex network problems and recurring platform issues

  • Support Account Managers and Customer Success team with complex implementations/strategic implementations

  • Design, build, and maintain core infrastructure that enables growth

Qualifications

  • Strong Experience with AWS

  • Strong Experience with observability and monitoring systems (Datadog, Prometheus, Grafana, etc.) Including building and designing advance monitoring.

  • Working experience in large-scale network and system engineering environments (ISP, Cloud Providers)

  • Experience with Linux system administration.

  • Experience with networking technologies and protocols (TCP/IP, LAN, NAT, BGP, VPN, DNS, iSCSI)

  • Experience with Configuration Management and IaC tools (Ansible, Terraform)

  • Experience with coding complex automation and runbooks

  • Good familiarity with virtualization environments (Proxmox, OpenStack)

  • Scripting experience with Bash, Python, or similar

  • Proficiency with virtualized and containerized environments (ECS / Kubernetes)

  • Experience with Hashicorp tools (Consul, Vault, Nomad) - An advantage.

  • Proven network debugging and problem-solving skills

Must be eligible to work in the United States without sponsorship now or in the future.

EOE M/F/Veterans/Disabled

Apply for this Position


icon no score

See how you match
to the job

Find your dream job anywhere
with the LiveCareer app.
Mobile App Icon
Download the
LiveCareer app and find
your dream job anywhere
App Store Icon Google Play Icon
lc_ad

Boost your job search productivity with our
free Chrome Extension!

lc_apply_tool GET EXTENSION

Similar Jobs

Want to see jobs matched to your resume? Upload One Now! Remove

Site Reliability Engineer, Security Service Edge

Check Point Software Technologies