Senior Databricks Data Engineer

Other Jobs To Apply

No other job posts for this day.

<span style="font-size:12pt;"><span style="line-height:115%;"><span style="font-family:Aptos, sans-serif;"><b>Overview:</b></span></span></span><br><span style="font-size:12pt;"><span style="line-height:115%;"><span style="font-family:Aptos, sans-serif;">We are seeking a <b>Senior Databricks Data Engineer</b> to design, build, and operate a Data & AI platform with a strong foundation in the Medallion Architecture (raw/bronze, curated/silver, and mart/gold layers). This platform will orchestrate complex data workflows and scalable ELT pipelines to integrate data from enterprise systems such as PeopleSoft, D2L, and Salesforce, delivering high-quality, governed data for machine learning, AI/BI, and analytics at scale.</span></span></span><br><span style="font-size:12pt;"><span style="line-height:115%;"><span style="font-family:Aptos, sans-serif;">You will play a <b>senior technical role in guiding engineering standards, reusable patterns, platform reliability, and production readiness across the Data & AI platform.</b></span></span></span><br><span style="font-size:12pt;"><span style="line-height:115%;"><span style="font-family:Aptos, sans-serif;">You will play a critical role in engineering the infrastructure and workflows that enable seamless data flow across the enterprise, ensure operational excellence, and provide the backbone for strategic decision-making, predictive modeling, and innovation.</span></span></span><br><br><span style="font-size:12pt;"><span style="line-height:115%;"><span style="font-family:Aptos, sans-serif;"><b>Responsibilities:</b></span></span></span><br><span style="font-size:12pt;"><span style="line-height:115%;"><span style="font-family:Aptos, sans-serif;"><b>1. Data & AI Platform Engineering (Databricks-Centric):</b></span></span></span><br><span style="font-size:12pt;"><span style="line-height:115%;"><span style="font-family:Aptos, sans-serif;">Provide senior-level technical leadership in the design, optimization, and standardization of Databricks engineering patterns across the Data & AI platform.</span></span></span><br><span style="font-size:12pt;"><span style="line-height:115%;"><span style="font-family:Aptos, sans-serif;">Design, implement, and optimize end-to-end data pipelines on Databricks, following the Medallion Architecture principles.</span></span></span><br><span style="font-size:12pt;"><span style="line-height:115%;"><span style="font-family:Aptos, sans-serif;">Build robust and scalable ETL/ELT pipelines using Apache Spark and Delta Lake to transform raw (bronze) data into trusted curated (silver) and analytics-ready (gold) data layers.<br>This will be a Remote role</span></span></span><br><span style="font-size:12pt;"><span style="line-height:115%;"><span style="font-family:Aptos, sans-serif;">Operationalize Databricks Workflows for orchestration, dependency management, and pipeline automation.</span></span></span><br><span style="font-size:12pt;"><span style="line-height:115%;"><span style="font-family:Aptos, sans-serif;">Apply schema evolution and data versioning to support agile data development.</span></span></span><br><span style="font-size:12pt;"><span style="line-height:115%;"><span style="font-family:Aptos, sans-serif;"><b>2. Platform Integration & Data Ingestion:</b></span></span></span><br><span style="font-size:12pt;"><span style="line-height:115%;"><span style="font-family:Aptos, sans-serif;">Lead the design of reusable ingestion frameworks and integration patterns that support scalable, reliable, and governed data onboarding across enterprise systems.</span></span></span><br><span style="font-size:12pt;"><span style="line-height:115%;"><span style="font-family:Aptos, sans-serif;">Connect and ingest data from enterprise systems such as PeopleSoft, D2L, and Salesforce using APIs, JDBC, or other integration frameworks.</span></span></span><br><span style="font-size:12pt;"><span style="line-height:115%;"><span style="font-family:Aptos, sans-serif;">Implement connectors and ingestion frameworks that accommodate structured, semi-structured, and unstructured data.</span></span></span><br><span style="font-size:12pt;"><span style="line-height:115%;"><span style="font-family:Aptos, sans-serif;">Design standardized data ingestion processes with automated error handling, retries, and alerting.</span></span></span><br><span style="font-size:12pt;"><span style="line-height:115%;"><span style="font-family:Aptos, sans-serif;"><b>3. Data Quality, Monitoring, and Governance:</b></span></span></span><br><span style="font-size:12pt;"><span style="line-height:115%;"><span style="font-family:Aptos, sans-serif;">Establish senior-level data quality, observability, and governance practices to improve trust, reliability, lineage, and operational transparency across the platform.</span></span></span><br><span style="font-size:12pt;"><span style="line-height:115%;"><span style="font-family:Aptos, sans-serif;">Develop data quality checks, validation rules, and anomaly detection mechanisms to ensure data integrity across all layers.</span></span></span><br><span style="font-size:12pt;"><span style="line-height:115%;"><span style="font-family:Aptos, sans-serif;">Integrate monitoring and observability tools (e.g., Databricks metrics, Grafana) to track ETL performance, latency, and failures.</span></span></span><br><span style="font-size:12pt;"><span style="line-height:115%;"><span style="font-family:Aptos, sans-serif;">Implement Unity Catalog or equivalent tools for centralized metadata management, data lineage, and governance policy enforcement.</span></span></span><br><span style="font-size:12pt;"><span style="line-height:115%;"><span style="font-family:Aptos, sans-serif;"><b>4. Security, Privacy, and Compliance:</b></span></span></span><br><span style="font-size:12pt;"><span style="line-height:115%;"><span style="font-family:Aptos, sans-serif;">Provide senior technical guidance on secure data engineering practices, access-control patterns, and compliance implementation across Databricks and related cloud environments.</span></span></span><br><span style="font-size:12pt;"><span style="line-height:115%;"><span style="font-family:Aptos, sans-serif;">Enforce data security best practices including row-level security, encryption at rest/in transit, and fine-grained access control via Unity Catalog.</span></span></span><br><span style="font-size:12pt;"><span style="line-height:115%;"><span style="font-family:Aptos, sans-serif;">Design and implement data masking, tokenization, and anonymization for compliance with privacy regulations (e.g., GDPR, FERPA).</span></span></span><br><span style="font-size:12pt;"><span style="line-height:115%;"><span style="font-family:Aptos, sans-serif;">Work with security teams to audit and certify compliance controls.</span></span></span><br><span style="font-size:12pt;"><span style="line-height:115%;"><span style="font-family:Aptos, sans-serif;"><b>5. AI/ML-Ready Data Foundation:</b></span></span></span><br><span style="font-size:12pt;"><span style="line-height:115%;"><span style="font-family:Aptos, sans-serif;">Partner with data science and AI/ML teams to shape reusable, production-ready data engineering patterns that support scalable model development, deployment, and monitoring.</span></span></span><br><span style="font-size:12pt;"><span style="line-height:115%;"><span style="font-family:Aptos, sans-serif;">Enable data scientists by delivering high-quality, feature-rich data sets for model training and inference.</span></span></span><br><span style="font-size:12pt;"><span style="line-height:115%;"><span style="font-family:Aptos, sans-serif;">Support AIOps/MLOps lifecycle workflows using MLflow for experiment tracking, model registry, and deployment within Databricks.</span></span></span><br><span style="font-size:12pt;"><span style="line-height:115%;"><span style="font-family:Aptos, sans-serif;">Collaborate with AI/ML teams to create reusable feature stores and training pipelines.</span></span></span><br><span style="font-size:12pt;"><span style="line-height:115%;"><span style="font-family:Aptos, sans-serif;"><b>6. Cloud Data Architecture and Storage:</b></span></span></span><br><span style="font-size:12pt;"><span style="line-height:115%;"><span style="font-family:Aptos, sans-serif;">Contribute to senior-level cloud data architecture decisions related to data lake design, storage optimization, compute efficiency, security, and cost management.</span></span></span><br><span style="font-size:12pt;"><span style="line-height:115%;"><span style="font-family:Aptos, sans-serif;">Architect and manage data lakes on Azure Data Lake Storage (ADLS) or Amazon S3, and design ingestion pipelines to feed the bronze layer.</span></span></span><br><span style="font-size:12pt;"><span style="line-height:115%;"><span style="font-family:Aptos, sans-serif;">Build data marts and warehousing solutions using platforms like Databricks.</span></span></span><br><span style="font-size:12pt;"><span style="line-height:115%;"><span style="font-family:Aptos, sans-serif;">Optimize data storage and access patterns for performance and cost-efficiency.</span></span></span><br><span style="font-size:12pt;"><span style="line-height:115%;"><span style="font-family:Aptos, sans-serif;"><b>7. Documentation & Enablement:</b></span></span></span><br><span style="font-size:12pt;"><span style="line-height:115%;"><span style="font-family:Aptos, sans-serif;">Mentor engineers and promote senior-level engineering standards through documentation, code reviews, reusable frameworks, and knowledge-sharing practices.</span></span></span><br><span style="font-size:12pt;"><span style="line-height:115%;"><span style="font-family:Aptos, sans-serif;">Maintain technical documentation, architecture diagrams, data dictionaries, and runbooks for all pipelines and components.</span></span></span><br><span style="font-size:12pt;"><span style="line-height:115%;"><span style="font-family:Aptos, sans-serif;">Provide training and enablement sessions to internal stakeholders on the Databricks platform, Medallion Architecture, and data governance practices.</span></span></span><br><span style="font-size:12pt;"><span style="line-height:115%;"><span style="font-family:Aptos, sans-serif;">Conduct code reviews and promote reusable patterns and frameworks across teams.</span></span></span><br><span style="font-size:12pt;"><span style="line-height:115%;"><span style="font-family:Aptos, sans-serif;"><b>8. Reporting and Accountability:</b></span></span></span><br><span style="font-size:12pt;"><span style="line-height:115%;"><span style="font-family:Aptos, sans-serif;">Take ownership of complex data engineering deliverables, production issues, technical risks, and cross-team dependencies requiring senior-level judgment and coordination.</span></span></span><br><span style="font-size:12pt;"><span style="line-height:115%;"><span style="font-family:Aptos, sans-serif;">Submit a weekly schedule of hours worked and progress reports outlining completed tasks, upcoming plans, and blockers.</span></span></span><br><span style="font-size:12pt;"><span style="line-height:115%;"><span style="font-family:Aptos, sans-serif;">Track deliverables against roadmap milestones and communicate risks or dependencies.</span></span></span><br><br><span style="font-size:12pt;"><span style="line-height:115%;"><span style="font-family:Aptos, sans-serif;"><b>Required Qualifications:</b></span></span></span><ul><li><span style="font-size:12pt;"><span style="line-height:115%;"><span style="font-family:Aptos, sans-serif;">Senior-level hands-on experience designing, building, optimizing, and supporting production-grade Databricks data engineering solutions in enterprise environments.</span></span></span></li><li><span style="font-size:12pt;"><span style="line-height:115%;"><span style="font-family:Aptos, sans-serif;">5+ years of hands-on experience with Databricks, Delta Lake, and Apache Spark for large-scale data engineering.</span></span></span></li><li><span style="font-size:12pt;"><span style="line-height:115%;"><span style="font-family:Aptos, sans-serif;">Deep understanding of ELT pipeline development, orchestration, and monitoring in cloud-native environments.</span></span></span></li><li><span style="font-size:12pt;"><span style="line-height:115%;"><span style="font-family:Aptos, sans-serif;">Experience implementing Medallion Architecture (Bronze/Silver/Gold) and working with data versioning and schema enforcement in enterprise grade environments.</span></span></span></li><li><span style="font-size:12pt;"><span style="line-height:115%;"><span style="font-family:Aptos, sans-serif;">Strong proficiency in SQL, Python, or Scala for data transformations and workflow logic.</span></span></span></li><li><span style="font-size:12pt;"><span style="line-height:115%;"><span style="font-family:Aptos, sans-serif;">Proven experience integrating enterprise platforms (e.g., PeopleSoft, Salesforce, D2L) into centralized data platforms.</span></span></span></li><li><span style="font-size:12pt;"><span style="line-height:115%;"><span style="font-family:Aptos, sans-serif;">Familiarity with data governance, lineage tracking, and metadata management tools.</span></span></span></li></ul><span style="font-size:12pt;"><span style="line-height:115%;"><span style="font-family:Aptos, sans-serif;"><b>Preferred Qualifications:</b></span></span></span><ul><li><span style="font-size:12pt;"><span style="line-height:115%;"><span style="font-family:Aptos, sans-serif;">Experience serving as a senior engineer, technical lead, or mentor on enterprise data platform or Databricks implementation initiatives.</span></span></span></li><li><span style="font-size:12pt;"><span style="line-height:115%;"><span style="font-family:Aptos, sans-serif;">Prior UMGC or USM experience preferred.</span></span></span></li><li><span style="font-size:12pt;"><span style="line-height:115%;"><span style="font-family:Aptos, sans-serif;">Experience with Databricks Unity Catalog for metadata management and access control.</span></span></span></li><li><span style="font-size:12pt;"><span style="line-height:115%;"><span style="font-family:Aptos, sans-serif;">Experience deploying ML models at scale using MLFlow or similar MLOps tools.</span></span></span></li><li><span style="font-size:12pt;"><span style="line-height:115%;"><span style="font-family:Aptos, sans-serif;">Familiarity with cloud platforms like Azure or AWS, including storage, security, and networking aspects.</span></span></span></li><li><span lang="en-us" style="font-size:12pt;"><span style="line-height:115%;"><span style="font-family:Aptos, sans-serif;">Knowledge of data warehouse design and star/snowflake schema modeling</span></span></span></li></ul>

Back to blog

Common Interview Questions And Answers

1. HOW DO YOU PLAN YOUR DAY?

This is what this question poses: When do you focus and start working seriously? What are the hours you work optimally? Are you a night owl? A morning bird? Remote teams can be made up of people working on different shifts and around the world, so you won't necessarily be stuck in the 9-5 schedule if it's not for you...

2. HOW DO YOU USE THE DIFFERENT COMMUNICATION TOOLS IN DIFFERENT SITUATIONS?

When you're working on a remote team, there's no way to chat in the hallway between meetings or catch up on the latest project during an office carpool. Therefore, virtual communication will be absolutely essential to get your work done...

3. WHAT IS "WORKING REMOTE" REALLY FOR YOU?

Many people want to work remotely because of the flexibility it allows. You can work anywhere and at any time of the day...

4. WHAT DO YOU NEED IN YOUR PHYSICAL WORKSPACE TO SUCCEED IN YOUR WORK?

With this question, companies are looking to see what equipment they may need to provide you with and to verify how aware you are of what remote working could mean for you physically and logistically...

5. HOW DO YOU PROCESS INFORMATION?

Several years ago, I was working in a team to plan a big event. My supervisor made us all work as a team before the big day. One of our activities has been to find out how each of us processes information...

6. HOW DO YOU MANAGE THE CALENDAR AND THE PROGRAM? WHICH APPLICATIONS / SYSTEM DO YOU USE?

Or you may receive even more specific questions, such as: What's on your calendar? Do you plan blocks of time to do certain types of work? Do you have an open calendar that everyone can see?...

7. HOW DO YOU ORGANIZE FILES, LINKS, AND TABS ON YOUR COMPUTER?

Just like your schedule, how you track files and other information is very important. After all, everything is digital!...

8. HOW TO PRIORITIZE WORK?

The day I watched Marie Forleo's film separating the important from the urgent, my life changed. Not all remote jobs start fast, but most of them are...

9. HOW DO YOU PREPARE FOR A MEETING AND PREPARE A MEETING? WHAT DO YOU SEE HAPPENING DURING THE MEETING?

Just as communication is essential when working remotely, so is organization. Because you won't have those opportunities in the elevator or a casual conversation in the lunchroom, you should take advantage of the little time you have in a video or phone conference...

10. HOW DO YOU USE TECHNOLOGY ON A DAILY BASIS, IN YOUR WORK AND FOR YOUR PLEASURE?

This is a great question because it shows your comfort level with technology, which is very important for a remote worker because you will be working with technology over time...