Senior Cloud Engineer (AWS/GCP) at Lasso Informatics Inc Lasso Informatique Inc
About This Position
Lasso Informatics is a SaaS start-up with a live research data management and analysis platform that brings together multi-modal (imaging, genetics, behavioral, and biosample) data for large-scale studies. Thousands of researchers across the globe rely on our platform today, and we are rapidly iterating and improving to push the boundaries of what is possible in research data management.
We live to innovate, and empower scientists to focus on the science, not the technology, leading to a faster time to science and cure.
Our team is incredibly diverse both by background and expertise, and that is not by accident. We believe that the most creative and powerful solutions come from different ways of thinking about the world. You will be working in an inspiring ecosystem alongside world-renowned professionals in medicine, physics, engineering, imaging, epidemiology, software development, and genetics. We thrive on empowering our colleagues to be thought leaders and innovate fresh new solutions for an exciting and rapidly changing field.
About the Role
This is a Senior Cloud Engineer role with a strong emphasis on reliability, security, cost control, and operational excellence in production cloud environments.
You will own and operate mission-critical cloud infrastructure across multiple environments and customers, ensuring systems are secure, compliant, observable, and scalable. While this role collaborates closely with engineering teams, it is not focused on CI/CD pipelines or developer tooling. Instead, it centers on building and maintaining robust, well-governed cloud systems that support regulated research workloads.
The ideal candidate brings deep hands-on cloud and systems expertise, paired with a disciplined, process-driven mindset and a track record of improving stability, cost efficiency, and service quality at scale
Key Responsibilities
Cloud Environment & Customer ManagementDesign, operate, and maintain multiple cloud environments (development, staging, production) across AWS and GCP.
Manage infrastructure for multiple customers with distinct configurations and compliance requirements.
Ensure consistency across environments and actively prevent configuration drift.
Plan and execute patches, upgrades, and infrastructure changes with minimal disruption.
Define and enforce clear operational rules of engagement for access, changes, and incident response.
Perform advanced Linux systems administration across cloud environments, including patching, hardening, troubleshooting, and performance tuning
Operate and evolve production Kubernetes platforms across AWS (EKS) and GCP (GKE), including upgrades, scaling, security hardening, and lifecycle management
Administer and tune PostgreSQL databases (Aurora RDS and managed PostgreSQL services), supporting performance, reliability, and data integrity
Evaluate, standardize, and operationalize cloud-native services across AWS and GCP to improve reliability, security, scalability, and operational consistency
Ensure cloud infrastructure adheres to defined standards while accommodating platform-specific best practices
Own observability across cloud environments using Datadog, spanning metrics, logs, and APM.
Design dashboards, alerts, and reports to enable proactive monitoring across AWS and GCP workloads.
Drive cost-awareness and optimization initiatives across both cloud platforms, balancing performance, reliability, and regulatory requirements.
Partner with engineering and research teams to identify inefficiencies and guide responsible cloud usage.
Collaboration & Experimental Environments
Collaborate with external institutions’ system and cloud teams to support joint projects and integrations.
Support pilot projects, proof-of-concept environments, and future product initiatives.
Act as a technical bridge between cloud operations, research teams, and internal engineering.
Process, Documentation & Audit Readiness
Develop and maintain Methods of Procedure (MOPs) and Standard Operating Procedures (SOPs).
Maintain detailed documentation in Confluence to support audit readiness and operational continuity.
Use Jira to manage cloud operations backlogs, patching cycles, and infrastructure changes.
Champion a documentation-first, process-driven culture for cloud operations.
Security & Compliance
Administer endpoint and workload protection tooling (e.g., CrowdStrike).
Own vulnerability management end-to-end: identification, remediation, validation, and documentation.
Apply cloud security best practices for IAM, secrets management, encryption, and network segmentation.
Support ISO, FedRAMP, NIST, SOC 2, and other compliance frameworks through disciplined cloud operations.
Leadership & Forward Planning
Provide technical leadership in cloud reliability and operations.
Balance stability with innovation, ensuring cloud infrastructure is prepared for future growth.
Influence cloud strategy and standards while maintaining operational rigor.
Required Skills & Experience
7+ years in cloud infrastructure, systems engineering, or cloud operations roles
Strong, hands-on Kubernetes administration experience in both GKE (required) and EKS
Strong PostgreSQL administration experience (Aurora RDS and/or managed PostgreSQL services)
Deep Linux systems expertise (patching, hardening, troubleshooting)
Proven experience operating multi-environment, multi-customer platforms across AWS and GCP
Experience with configuration management tools (Ansible, Puppet, or equivalent)
Strong process orientation with experience writing MOPs and SOPs
Comfortable using Jira, Confluence, and Bitbucket for planning and documentation
Detail-oriented with a strong commitment to operational discipline
Experience with Keycloak or similar identity platforms
Nice to Have
Infrastructure-as-Code experience (Terraform, CloudFormation)
Experience in regulated environments (ISO, FedRAMP, SOC 2, HIPAA, NIST 800-171)Experience operating Kubernetes workloads across multiple cloud providers
Familiarity with Active DirectoryExperience with secure research compute platforms (SLURM, Open OnDemand)
Experience with Globus (Auth, Share, Compute)
Deep Datadog experience (dashboards, monitors, logs, APM, synthetic testing)
Experience working with external research institutions or partners
Familiarity with CrowdStrike or similar endpoint security tooling
Competitive salary and benefits package
In-office work culture with required presence Tuesday through Thursday
Opportunities for leadership and professional growth
Collaborative team committed to innovation, quality, and scientific impact
Access to training resources and ongoing professional development