Dellent is a Portuguese consulting, technology services and outsourcing company, focused mainly on IT and Telecommunication services, aiming to create teams capable of delivering highly qualified solutions.
From consulting to Nearshore services, our 150 consultants work closely with our clients to bring our clients' projects to life and to the next level.
We are always looking for bright and driven people who have a penchant for technology and want to hone their skills.
Site Reliability Engineering is responsible for designing, building and maintaining the solutions and infrastructure that ensure our cloud service maintains availability, scalability, performance and efficiency. The ideal candidate bridges the gap between software engineering and technical operations, has excellent communications skills and is an energetic detail-oriented team player who enjoys working in a fast-paced environment.
Develop and maintain an understanding of the software development and deployment pipeline. This includes initial software architecture and development, the continuous integration workflow, deployment of software through the pipeline to production, and monitoring of software and infrastructure in production.
Design, build and maintain automation that powers our cloud-based product deployments and auto-scaling. Automation systems to deploy must do so reliably with zero downtime and the ability to easily rollback.
Design and implement monitoring and alerting that are actionable.
Work closely with software development and testing teams assuring that software is reliable and scalable.
Build and maintain supporting infrastructure for AWS (or other) cloud environments through an “infrastructure as code” methodology.
Perform and automate system administration services including installation, configuration, maintenance, and disaster recovery.
Identify emergent issues that indicate scalability and reliability problems, working with development and operations teams to help drive solutions.
Maintain SLA compliance.
Assist with Root Cause Analysis of service interruptions.
Automation, auditing, and other tooling for security, compliance, and resource usage.
Strong distributed systems knowledge and transactional services design, implementation, and/or operation experience.
Software design fundamentals in data structures, algorithm design and performance analysis.
Experience writing and maintaining complex software in at least 2 of the following languages: Python, Go, NodeJS, Bash, Lua, Ruby, Java.
Experience with configuration and deployment management tools (Puppet, Terraform, Chef).
Experience implementing servers and services in a Linux environment.
In-depth understanding of TCP, UDP, IP and HTTP protocols.
Experience with operations and automated provisioning in cloud-based environments (AWS preferred)
Experience with SQL (e.g. PostgreSQL) and NoSQL databases (e.g. Redis, Elasticsearch).
Experience with Distributed Version Control software.
Integration in a dynamic, experienced and friendly team;
Technical, behavioural and linguistics training opportunities;
If you are interested in challenging work and solving complex problems through technology - then Dellent is right for you!
We are looking forward to your application, apply here.