Data Engineer Job at VDart Inc, Remote

RWJNTlZicktiMjBUc202ekFycmFjSFgrSUE9PQ==
  • VDart Inc
  • Remote

Job Description

Title: Data Engineer

Location: Remote

Duration: 6 Months

Work Description:

We are in the process of migrating off CRMA Data Manager by rewriting queries and implementing the required data transformations in AWS. This platform modernization effort includes working through a backlog of datasets that must be migrated to AWS and transformed to meet current and future reporting needs.

Business Knowledge:

Limited business knowledge is needed.

Technical Skills:

Must-Have Technical Skills:

  • AWS Data Services (Hands-on)
  • S3: Data lake design, partitioning strategies, lifecycle management
  • IAM: Roles & policies, least-privilege access, cross-account access
  • Glue / EMR: Crawlers, Data Catalog, ETL job development
  • Athena: Querying data lakes with performance and cost optimization
  • Lake Formation: Basic governance and permission management

Compute & Processing

  • Apache Spark (PySpark): Batch processing, performance tuning, joins, partitioning
  • Python: Production-grade coding (packaging, testing, logging, type hints)
  • SQL: Advanced querying (window functions, query optimization, data modeling support)

Orchestration & Scheduling

  • Airflow / MWAA / AWS Step Functions
  • DAG design
  • Retry mechanisms
  • SLA management
  • Backfills
  • Data Warehousing & Modeling
  • Redshift / Snowflake (on AWS): Fundamentals and performance considerations
  • Dimensional Modeling: Star/Snowflake schema design

ETL/ELT Patterns:

  • CDC (Change Data Capture)
  • SCD (Slowly Changing Dimensions)
  • Idempotent data pipelines
  • Data Reliability & Observability
  • Data quality frameworks: Great Expectations / Deequ (or equivalent)
  • Data reconciliation & validation
  • Monitoring & observability: CloudWatch logs, metrics, alerts

DevOps & Delivery

  • Version Control: Git, branching strategies, code reviews
  • CI/CD: Data pipeline automation (e.g., GitLab CI/CD)
  • Infrastructure-as-Code: OpenTofu / CloudFormation for AWS resource deployment

Security & Compliance

  • Encryption: At rest & in transit (KMS)
  • Secrets management: AWS Secrets Manager / SSM
  • Networking fundamentals: VPC, private subnets, endpoints (data access control)

Role Expectations (Hands-on Experience Required):

  • Designed, developed, and maintained production-grade ETL pipelines using AWS Glue (PySpark)
  • Built scalable data ingestion pipelines from S3, databases, and streaming sources into S3 data lakes
  • Implemented complex transformations and joins in PySpark, optimizing performance (partitioning, broadcast joins, caching)
  • Developed incremental and idempotent pipelines, including handling CDC and SCD
  • Automated schema discovery using Glue Crawlers and Data Catalog
  • Tuned Glue Spark jobs for performance, concurrency, and cost efficiency
  • Integrated pipelines with orchestration tools like Airflow (MWAA) or Step Functions
  • Collaborated with data teams to load curated data into Redshift / Snowflake / Iceberg for analytics
  • Implemented data quality checks using built-in validations or tools like Great Expectations / Deequ
  • Applied AWS security best practices (IAM roles, KMS encryption, secure data access)
  • Contributed to CI/CD pipelines for Glue job deployment using Git and IaC tools
  • Monitored pipelines using CloudWatch, ensuring reliability and quick incident resolution
  • Worked closely with stakeholders to define data contracts, SLAs, and business expectations

Key Skills: Data Engineer, AWS Glue, IAM, ETL, Athena, PySpark

Job Tags

Full time

Similar Jobs

Confidential

Relationship Coordinator - NYC Job at Confidential

 ...Position Overview Job Title Relationship Coordinator Corporate Title Analyst Location New York, NY Overview As a...  ...services, guidelines and procedures and be able to research account transactions records to resolve discrepancies and answer questions or... 

REEDS Jewelers

Bench Jeweler - Mayfaire Town Center Job at REEDS Jewelers

 ...REEDS Jewelers Bench Jeweler (Full-Time) Location: Mayfaire Town Center, Wilmington, NC REEDS Jewelers is seeking a skilled Bench Jeweler to join our team at our beautifully remodeled flagship store in Mayfaire Town Center a true luxury destination in one of... 

Hawthorne Lane

International Trade Legal Administrative Assistant/Paralegal Job at Hawthorne Lane

As the International Trade Legal Administrative Assistant/Paralegal to a busy practice group, you will support client development initiatives and administrative-based tasks in a dynamic DC law firm. Working closely with the Practice Group Manager, you will prepare filings... 

ForeScout

Intern - Software Engineering Job at ForeScout

 ...DepartmentOn-Prem EngineeringRoleIntern Software EngineeringOverviewForescout is one of the most impactful cybersecurity companies...  ...compatibility and stability throughout the system.~Develop automation workflows to detect vulnerabilities, apply patches... 

Chicago Rehab Partners

Residential Rehab Projects Job at Chicago Rehab Partners

 ...Electricians Plumbers HVAC Technicians Drywall & Taping Crews Flooring Installers Painters Roofing & Siding Crews Finish...  ...repeat work immediately . Job Types: Full-time, Part-time, Contract Pay: $20.00 - $30.00 per hour Work Location: In person...