Site Reliability Engineer - 2056

Sugar CRM Cleveland , OH 44114

Posted 4 weeks ago

About SugarCRM, Inc.

SugarCRM is a customer experience leader enabling businesses to create profitable customer relationships by delivering highly relevant, personalized experiences throughout the customer journey. We empower companies to strengthen existing customer relationships, create new ones through actionable insights and intelligent automation and better understand the customer at every stage of the journey. This enables businesses to accelerate demand generation, grow revenue, deliver superior customer care and increase loyalty. Our easy-to-use, intuitive platform makes customer experience easy and accessible for everyone, allowing marketing, sales and services professionals to focus on high-impact, value-adding activities that create customers for life.

Where do you fit?

To join our growing team, SugarCRM is currently seeking an experienced Site Reliability Engineer. This role can be based in one of our U.S.-based offices or remote.

Impact you will make in the role:

  • Manage applications in a CentOS Linux-based environment

  • Build repeatable infrastructures with Ansible

  • Develop and execute plans for rolling out new technologies rapidly

  • Improve monitoring infrastructure, build out data aggregation and alerting rules

  • Work closely with engineering to build scalable solutions

  • Triage tickets raised by our support organization and implement fixes

  • Support our private and public cloud environments and customers

  • Mentor other members of the Operations team

  • Participate in an on-call rotation

Expertise you will bring in:

  • BA/BS in Computer Science with Network Engineering or Information Systems emphasis, or equivalent work experience

  • Extensive knowledge with container orchestration technologies including Docker and Kubernetes

  • 6+ years experience in an Operations or Systems Administration role

  • Superior Unix administration skills

  • Extensive knowledge of common Internet Protocols

  • Extensive knowledge of TCP/IP

  • Experience with virtualization and cloud technologies

  • Hardware management, network switch and router administration experience

  • Experience with Apache, MySQL, and PHP in a production environment at scale

  • Strong knowledge of version control systems and hands-on experience with Git

  • Experience with writing code around infrastructure automation

  • Understanding of how to architect and implement highly available, scalable, and secure network in multiple cloud environments

  • Strong affinity and experience in working with continuous deployment and continuous integration environments

  • An understanding around micro-service architectures and the complexities around their deployments

  • Extensive programming experience in PHP, Ruby, Python, and Shell

  • Full stack troubleshooting and instrumentation experience

  • Extensive experience with IT automation technologies like Puppet, Salt, Chef, or Ansible

  • Experience with data aggregation, alerting, and reporting and supporting technologies such as Sensu and Graphite

Nice to haves:

  • Experience in an on-call rotation

  • Experience with Elastic Search or Apache Solr

  • Experience with Spinnaker and/or other CI/CD tools

  • Previous experience as a mentor or advisor

  • Current contributor to open source projects (a Github account you can link us to would be ideal)

Location: Cupertino, CA., Raleigh NC., Atlanta, GA, Orlando, FL, or Remote, U.S.

We are an Equal Opportunity, Affirmative Action employer. Minorities, women, veterans and individuals with disabilities are encouraged to apply.

Benefits and Perks:

Beyond a stellar work environment, friendly people, and inspiring, innovative work, we have some great benefits and perks:

  • Competitive salaries

  • Excellent medical, dental and vision coverage for you and your family, along with other benefit plans like 401(k) match

  • Unlimited Paid Time Off

  • Wellness Reimbursement Program

  • Onsite Programs, depending on location, such as Dry Cleaning, Car Washes, Massage, Yoga, and more

  • Career & Personal Development Program multi-platform

  • New Hire Onboarding for all new employees worldwide

  • Regular social events

  • Ownership is the greatest self-identity at SugarCRM - you are making an impact now

  • We are a merit-based company - many opportunities to learn, excel and grow your career

Note to Recruiters and Placement Agencies: SugarCRM does not accept unsolicited agency resumes. Please do not forward unsolicited agency resumes to our website or to any SugarCRM employee. SugarCRM will not pay fees to any third-party agency or firm and will not be responsible for any agency fees associated with unsolicited resumes. Unsolicited resumes received will be considered property of SugarCRM and will be processed accordingly.

icon no score

See how you match
to the job

Find your dream job anywhere
with the LiveCareer app.
Mobile App Icon
Download the
LiveCareer app and find
your dream job anywhere
App Store Icon Google Play Icon

Boost your job search productivity with our
free Chrome Extension!

lc_apply_tool GET EXTENSION

Similar Jobs

Want to see jobs matched to your resume? Upload One Now! Remove
Senior Site Reliability Operations Specialist


Posted 1 week ago

VIEW JOBS 8/17/2019 12:00:00 AM 2019-11-15T00:00 Founded in 1866, The Sherwin-Williams Company is a global leader in the manufacture, development, distribution, and sale of paints, coatings and related products to professional, industrial, commercial, and retail customers. The company manufactures products under well-known brands such as Sherwin-Williams®, Valspar®, HGTV HOME® by Sherwin-Williams, Dutch Boy®, Krylon®, Minwax®, Thompson's® Water Seal®, Cabot® and many more. Sherwin-Williams® branded products are sold exclusively through a chain of more than 4,100 company-operated stores and facilities, while the company's other brands are sold through leading mass merchandisers, home centers, independent paint dealers, hardware stores, automotive retailers, and industrial distributors. The company supplies a broad range of highly-engineered industrial and OEM coatings for wood and general industrial, coil, packaging, protective and marine, and transportation applications worldwide. Our 60,000 employees are diverse, innovative and passionate. With a variety of rewarding and challenging opportunities, Sherwin-Williams is a great place to find a career that takes you places. The eCommerce Senior Site Reliability Operations Specialist position focuses on detection, remediation and prevention of incidents ensuring maximum availability and reliability for our users and customers. The position requires strong business process and technology knowledge coupled with an operational excellence mindset that supports a best in class customer experience. The role is responsible for working with the business and technical teams to identify applications, features and integrations that should be monitored. Creating monitoring dashboard and generating reports to increase visibility for KPIs. Develop and enforce service level agreements (SLAs) with key stakeholders. Define and enforce critical incident response processes including handling communication to business and technical stakeholders.. Define problem management processes to prevent future incidents by prioritizing and completing root cause analysis (RCA). Ensure non-critical production issues are routed and triaged to the appropriate teams. Train on new features released to customers to understand site functionality. This is an individual contributor position. Essential Functions Incident Management * Initial incident management triage and ticket assignment to the appropriate team. Define and enforce across the IT E-Business COE critical incident response processes. * Define and enforce service level agreements between the provider and the customer that defines incident priorities, escalation paths, and response/resolution time frames. * Front line communication of high and critical incidents to key stakeholders. * Categorization of incident types for better data gathering and problem management. * Ensure Incident closure and documentation. * Interact with customer facing teams to address questions and problems. Problem Management * Define problem management processes to prevent future incidents by prioritizing and completing root cause analysis (RCA). * Work with business and IT staff to understand the impact and priority of the problem. * Oversee plan development and execution for problem resolution. * Ensure progress on problems being addressed. * Proactively work with engineers to identify and remediate single points of failure. Monitoring and Reporting * Identify applications, features, functions and integrations that should be monitored. * Partner with technical teams to ensure identified items are monitored. * Creation and oversight of monitoring dashboard. * KPI and Incident report generation for increased visibility. * Collaborate to define alerting thresholds are in place and relevant. Incidental Functions * Work in a hybrid waterfall / agile development environment. * Conduct research into new technologies, including tools, components, and frameworks. * Perform task management and reporting as necessary. * Provide tier 2, on-call support for critical deployment problems and issues. * Assist with other projects as may be required to contribute to efficiency and effectiveness of the work. * Work outside the standard office 7.5 hour workday as required. * Coordinate and drive disaster recovery activities as needed * Up to 10% travel is required. Position Requirements Formal Education & Certification * Bachelor degree or foreign equivalent in a related field or equivalent experience. Knowledge & Experience * 5 years IT experience. * 5 years IT operational support experience. * 5 years experience in customer service related work. * 2 years hands-on experience working with incident management. * A proven track record working with incident management tools and concepts. * Experience using agile project management tools (such as Rally or JIRA). * Working knowledge of Microsoft Office Suite. * Experience in developing operational metrics and data. Preferred Qualifications and Skills * Experience with Agile and Waterfall development and release practices. * Experience with application monitoring software. * Experience with IT KPI reporting. * Experience influencing and negotiating in a professional environment. * Ability to chair, facilitate and lead meetings. Personal Attributes * Strong written and oral communications skills. * Proven ability and initiative to learn and research new concepts, ideas, and technologies quickly. * Strong systems/process orientation with demonstrated analytical thinking, organization skills and problem solving skills. * Ability to work in a team-oriented, collaborative environment. * Ability to quickly pick up new tools and technologies. * Willingness and ability to train and teach others. * Ability to facilitate meetings and follow up with resulting action items. * Ability to prioritize and execute tasks in a high-pressure environment. * Strong presentation and interpersonal skills. * Ability to work effectively in a multi-cultural environment, and to lead and influence cross-organizationally with and without direct authority. * Ability to effectively move forward on tasks even with ambiguous or changing requirements. Must be legally authorized to work in country of employment without sponsorship for employment visa status now or in the future. Equal Opportunity Employer. All qualified candidates will receive consideration for employment and will not be discriminated against based on race, color, religion, sex, sexual orientation, gender identity, national origin, protected veteran status, disability, age, pregnancy, genetic information, creed, citizenship status, marital status, or any other consideration prohibited by law or contract. VEVRAA Federal Contractor requesting priority referral of protected veterans. Sherwin-Williams Cleveland OH

Site Reliability Engineer - 2056

Sugar CRM