Site Reliability Engineer (SRE)

  • Porto
  • Devexperts
Company DescriptionDevexperts has been working for nearly two decades consulting and developing for the financial industry. We solve complex technological challenges facing the most well-respected financial institutions worldwide.By becoming a part of Devexperts, you’ll become a part of a company that fosters self-improvement and actively seeks out-of-the-box ideas. Our teams work together to create the next generation of financial software solutions. We welcome all candidates who believe, as we do, that innovation is grounded in education.Job DescriptionWe are looking for a Senior Site Reliability Engineer (SRE) to fill the open position in a team that develops and supports proprietary trading platforms for large scale clients. You will help the existing team to ensure access to various markets to end users from a lot of countries. You will be responsible for maintaining availability, automating release/deploy process, seamless monitoring, and alerting of all the solutions.work closely with developers for prototyping, and designing new features as part of the infrastructuredeploy, install, configure and maintain sophisticated Trading/Finance and related softwareconfigure bare metal & сloud instances by using Infrastructure as Codemake key decisions for scalability, reliability and accessibilityinstall and manage in-house developed and external well-known monitoring systemsdesign, deploy and configure cloud-based servers and networks provision servers and storage, configure firewalls, VPN, monitoring, etc.administrate UNIX/Cloud infrastructure – installation, configuration and maintenancework with the Nexus and GIT repositoriesQualifications5+ years of experience in UNIX/Linux administration5+ years of experience in Networkingexperience as an SRE or DevOpsstrong experience with OS-level administration on Linux and/or UNIXhands-on scripting experience with Bash, Python, and/or Groovyexperience with configuring TeamCity CI/CD pipelinesIAAS solutions using Ansible (AWX), Terraformexperience with Docker containers orchestrating (K8S/OpenShift/Hashicorp)know how to read and analyze errorsin-depth knowledge of TCP/IP and ISO/OSI stackexperience with monitoring and logging tools (Zabbix, Elasticsearch, or OpenSearch, Grafana, Kibana, Dynatrace, Prometheus, etc.)experience in working with Apache, Nginx, HAproxy, Envoy, etcstrong ability to solve problems using code and scriptingunderstanding of ITIL processes and routinesExcellent English (written and verbal)Also the following knowledge or experience will be to your advantage:experience with SQL-like command languageexperience with Ansible (AWX)knowledge of Java programming languageexperience with trading/exchange/risk management software usageexperience with Atlassian software (JIRA, Confluence, FishEye, etc.)