We are a diverse team from around the world working together on a mission to make DuckDuckGo the world's most trusted search engine, and we want your help!
Join us as a site reliability engineer at DuckDuckGo and become part of the team shaping our growing infrastructure. As a member of our Operations team you will work together with your peers to keep the search engine online, stable and fast. You will leverage your expertise to challenge our assumptions about the reliability of our deployment and the effectiveness of our processes as we grow.
As a member of a global Operations team you will sometimes be expected to work inconvenient hours for on-call responsibilities and synchronous work with your team. The ability to coordinate with people across time zones is an expectation of the role.
What you will do:
- Lead high complexity projects from scoping to deployment to production
- Develop effective tools, alerts, and responses to identify and address reliability risks
- Work closely with search engineers to triage production issues and determine appropriate remediation including code changes and performance considerations
- Share the burden of on call responsibilities - collaborating with other engineers to triage and fix reliability issues that come up in production and autonomously put out fires that may come up
- Help determine the future technical direction of our deployment with an effort to improve reliability and performance
What we are looking for:
- Significant experience as a site reliability engineer (around 2+ years).
- Ability to root cause sources of instability of high-traffic, distributed systems.
- Experience with configuration and troubleshooting of Linux and NGiNX.
- Strong understanding of reliability challenges of large-scale deployments.
- Moderate to advanced programming experience (preferably in a high level language like Perl or Python).
- Effective project management skills.
- Strong decision makers. You can make a decision when faced with competing priorities and limited information.
- Someone interested in the why not just the how. You like to analyze situations and won't be satisfied with a shallow analysis.
- Creative problem solvers and risk takers. You like to take initiative in pushing a project forward but can make adjustments based on team feedback.
- Strong communication skills. You can validate and communicate your decisions clearly.
More about the company:
As a remote employee at DuckDuckGo you will have the freedom to live anywhere in the world. You will be trusted with autonomy to execute your projects in collaboration with a team. This means that you must be self-directed and self-motivated to succeed.
If that seems awesome and you believe in our core values -- build trust, question assumptions, and validate direction -- you'll fit right in!
- As a global team we communicate with a variety of tools throughout the day (synchronous and asynchronous). You should feel comfortable with the intricacies of this type of work situation.
- Sometimes we meet up! You can expect to travel at least 2x a year: once for our all-hands meetup and another for a team retreat (each ~4-5 days)
- We want to have a major impact on raising the standard of trust online. To do this we believe in a focused approach, with company-wide objectives, and with each team member working on a single top priority at a time.
- Our work philosophy is built upon empowered project management. All team members have opportunities to run projects.
- All projects are run transparently, and we encourage everyone to participate in areas of interest throughout the company. Anyone and everyone can (and should) ask questions and offer feedback around the product and internal projects.
- We try to exemplify our values (build trust, question assumptions, and validate direction) in everything we do.
If you think you might thrive in this environment, we would love to hear from you!