The Cloud Platform Engineer focuses on building and operating the environment and ensures a reliable, well-governed, high-performance platform. This role also owns infrastructure, access, tooling, observability, and standards across the full stack.
Responsibilities
Main Accountabilities
Provisions, configures, and tunes Databricks workspaces, Unity Catalog, and Apache Spark cluster settings (memory, executors, autoscaling) for automated, cost-efficient compute.
Defines and enforces Spark usage standards, query patterns, resource quotas, and Delta Table partitioning guidelines to optimize storage and prevent cluster contention.
Administers and tunes Airflow environments (upgrades, plugins, monitoring), enabling automated DAG authoring and workflow management.
Owns IAM roles, policies, and least-privilege access management across all platform services to ensure security.
Maintains and expands the Atlan data catalogue by onboarding sources, enforcing metadata standards, and supporting governance workflows.
Manages Git-based repositories (CodeCommit), branching strategies, and CI/CD pipelines, leveraging Kiro for accelerated documentation and AI-assisted workflows.
Builds self-service tools and monitors platform health, SLAs, and costs, acting as the primary contact for incidents and technical escalations.
Requirements
Cloud Infrastructure
Education, Skills and Experience
Bachelor’s degree in Computer Science, Software Engineering or related field.
1+ years of experience in a cloud platform, infrastructure, or site reliability engineering role.
Experience in AWS foundations across core services such as S3, IAM, VPC, EC2, CloudWatch, and cost management
Practical knowledge of Databricks administration, including workspaces, clusters, and Unity Catalog.
Experience running Airflow in production, specifically focused on managing the environment.
Understanding of Apache Spark internals such as the execution model, memory management, and performance tuning at a platform level.
Understanding of Delta Lake or Delta Tables at the storage and governance level.
Familiarity with data cataloguing tools such as Atlan or equivalent.
Infrastructure-as-code mindset using tools such as Terraform, CDK, or similar.
Ability to balance platform standards with developer experience and velocity.
Competencies
Excellent interpersonal skills with ability to communicate effectively at all levels
Strong analytical and problem-solving capabilities
Results-oriented mindset
Effective time management with ability to multi-task and prioritize work
Benefits
Bonus structure
Certifications and training
Continuous education
Private health insurance
Remote work options
Benefits Here’s what you can look forward to as part of the #CepalTeam:
Competitive Compensation : We offer an attractive salary, annual performance- based bonuses, and a monthly meal allowance through our ticket restaurant card
Health: Private medical insurance is provided for you and your family
Family Support: Monthly financial allowance for early education (nursery) and coverage of expenses for children with neurodiversity or disabilities—including therapeutic swimming, music therapy, horse riding, and parental support
Flexible Work Model: Our hybrid approach offers a level of remote work flexibility that supports work-life balance while preserving strong collaboration and team spirit
Modern Workspaces: Contemporary offices designed to support comfort, health, and productivity, with fully equipped workstations, quiet areas, on-site restaurant, and group fitness sessions
Lifelong Learning: Cepal supports continuous learning through access to e-learning platforms and structured professional development programs
Career Progression: We are committed to your growth, offering a clear development path supported by feedback, mentoring, and personalized learning plans
Make a Difference: Get involved in regular wellbeing, ESG and volunteering initiatives that reflect our values and foster a sense of purpose and community