DevOps Engineer - Platform Operations

New York, NY
Full Time
Mid Level

Systems Engineer - Platform Operations (DevSecOps)

Job Summary:

Caden’s mission is to create an equitable and fair data economy online by giving users ownership of their personal data, the ability to securely control it, and various ways to put it to work to create value with brands they trust, while always preserving their privacy.

We’re led by industry veterans, backed by powerhouse investors (almost $25M total investment), and crewed by the brightest minds in the game. We exist to make the internet a better place for all.

As we continue to expand, we are seeking an additional dynamic and results-driven Platform Operations (DevOps) Systems Engineer to contribute to the building and scaling of Caden’s platform . This position will report to the Lead Engineer - Platform Operations and will be instrumental in building and owning the software development effort for the Caden platform. 

Your skills will add to the team’s expertise in designing, implementing, and maintaining the systems and infrastructure that form the backbone of Caden’s platform. You will work closely with members of Caden’s Engineering team to troubleshoot issues, optimize performance, and deploy updates, ultimately ensuring high availability, security, and scalability. By monitoring system health, analyzing performance metrics, and identifying potential vulnerabilities, you will contribute to the overall stability and reliability of the platform, enabling the company to deliver uninterrupted services, enhance customer satisfaction, and drive business growth. 

What You’ll Do:

  • Collaborate with software and data engineers to design, implement, and maintain our systems infrastructure.
  • Utilize Terraform to automate the provisioning, configuration, and management of infrastructure resources across multiple cloud platforms.
  • Implement and configure monitoring and alerting solutions using Datadog to ensure the health and performance of our systems.
  • Work with Helm to manage deployments and updates of containerized applications within Kubernetes clusters.
  • Assist in the deployment and management of services on public cloud platforms such as AWS and GCP.
  • Contribute to the development of observability practices and tools to enable effective monitoring, logging, and debugging of distributed systems.
  • Administer and support distributed systems, ensuring their reliability, performance, and scalability.
  • Troubleshoot and resolve infrastructure-related issues, ensuring minimal impact on operations.
  • Collaborate with cross-functional teams to ensure the seamless integration of new services and applications into our existing infrastructure.
  • Stay up-to-date with emerging technologies and industry trends, continuously improving your technical skills and knowledge.

What You’ve Done:

  • Strong understanding of infrastructure-as-code (IaC) principles and experience using Terraform for infrastructure provisioning and management.
  • Familiarity with monitoring and observability tools such as Datadog to track system performance, troubleshoot issues, and ensure scalability.
  • Proficiency in managing containerized applications using Helm within Kubernetes clusters.
  • Experience with public cloud platforms, preferably AWS and GCP, including deploying and managing services.
  • Knowledge of distributed systems concepts, best practices, and hands-on experience with their administration.
  • Experience with Kafka for building and managing distributed streaming platforms.
  • Familiarity with MongoDB or other NoSQL databases, including administration and performance tuning.
  • Strong problem-solving skills and the ability to analyze and resolve complex infrastructure issues.
  • Excellent communication and collaboration skills, with the ability to work effectively in a team-oriented environment.
  • Self-motivated and eager to learn new technologies and stay updated with industry advancements.
  • Bachelor's degree in Computer Science, Engineering, Information Technology or a related field.

Bonus points for:

  • Any experience with Kafka or Confluent Cloud.
  • Relevant certifications such as AWS Certified Solutions Architect, GCP Cloud Engineer, or Kubernetes certifications.
  • Familiarity with additional tools and technologies related to infrastructure automation, containerization, and distributed systems.

Why Caden?

  • Join a high-growth startup that is at the forefront of innovation!
  • Opportunity to make a significant impact on the company's strategic and growth trajectory.
  • Collaborative and inclusive work environment that encourages innovation and growth.
  • Competitive compensation package.
  • A wide range of health & commuter Benefits.
  • Hybrid work arrangements.
  • Flexible PTO!
  • In-office work perks such as a plethora of snacks & drinks, pet-friendly environment and more!

Salary: $120,000 - $135,000 base. Salary may vary based on experience.

We offer a competitive salary, equity, hybrid work arrangements, and opportunities for professional development.

This role will work (Hybrid, 3 days a week) out of our downtown Manhattan office.

** There is currently no relocation and/or visa (immigration) assistance provided for this position.

Share

Apply for this position

Required*
We've received your resume. Click here to update it.
Attach resume as .pdf, .doc, .docx, .odt, .txt, or .rtf (limit 5MB) or Paste resume

Paste your resume here or Attach resume file