Responsibilities

  • Run the production environment by monitoring availability
  • Build software and tools to manage platform infrastructure and applications
  • Improve reliability, quality, and time-to-market of our suite of software solutions
  • Measure and optimize system performance, with an eye toward pushing our capabilities forward, getting ahead of customer needs, and innovating to continually improve
  • Implement solutions that are automated, make complex technical processes more streamlined and efficient
  • Provide primary operational support and engineering for multiple large distributed software applications

Requirements

  • Bachelor’s degree in computer science or other highly technical, scientific discipline
  • Ability to program (structured and OO) with one or more high level languages, such as Python, Java, C#, and JavaScript
  • Experience with infrastructure technologies like Operating Systems (Windows and Linux), networking, storage, virtualisation
  • Familiar with testing automation tools
  • Have an urge for delivering quickly and iterating fast
  • A proactive approach to spotting problems, areas for improvement, and performance bottlenecks
  • Previous success in leading large software engineering teams of more than 40 engineers with production support
  • Coding experience beyond simple scripts
  • Excellent communication
  • Thriving as a member of a team
  • Excel under pressure
  • The ability to think fast
  • A natural problem solver
  • Deep knowledge of below or more technical competencies
  • Great software engineer and able to code in resolving defects or vulnerabilities of our systems
  • Use infrastructure automation tools such as Chef or Ansible to efficiently manage our infrastructure
  • Implement “Infrastructure as Code” using Terraform and CI/CD for automation
  • Load balancing and high availability architecture of application including Proxies and CDN through the use of F5
  • Openshift and containerizing our system
  • Administer and manage high-availability, high-performance Microsoft SQL Server or Oracle cluster.
  • Monitoring and Metrics in Dynatrace, ELK or eG and integrations with Dynatrace / ITSM
  • Logging infrastructure
  • Backend storage management and scaling
  • Disaster Recovery and High Availability strategy