Data engineer (6 Month contract)

Downtown Core, S00, SG, Singapore

Job Description

Position Overview




Job Title: Data Engineer (6-Month Contract)




Department: Services




Location: Singapore




Reporting To: Contract




Duration: 6 months







Tookitaki is seeking a Data Engineer (Contract) with strong expertise in Apache Spark and Cloudera (CDP) to support high-priority data initiatives for our AI-driven financial crime prevention platforms--FinCense and the AFC Ecosystem. This role will contribute to building and maintaining robust data pipelines that ensure accurate, scalable, and production-grade data processing across real-time and batch workflows.




Position Purpose





This role is designed to support data engineering efforts during a critical delivery phase. The engineer will work closely with platform, product, and services teams to enable high quality data ingestion, transformation, and availability across Tookitaki's compliance modules. The work done in this role directly contributes to risk scoring, transaction monitoring, and fraud detection systems for global banks and fintech clients.




Key Responsibilities





1. Spark-Based Data Development


Design and optimize batch and streaming pipelines using Apache Spark. Debug performance and memory issues in Spark-based ETL processes.

2. Cloudera Data Platform (CDP) Handling


Leverage HDFS, Hive, Impala/Trino, and HBase within Cloudera to support data workflows. Collaborate with infra teams to ensure CDP cluster reliability and schema alignment.

3. Pipeline Development & Monitoring


Build ingestion pipelines using Kafka, Hive, Spark for large-scale financial datasets. Support Airflow-based orchestration and ensure production SLAs are met.

4. Data Validation & Debugging


Write and optimize SQL queries to validate data accuracy and ingestion success. Assist in tracing pipeline issues and executing backfills if necessary.

5. Cross-Functional Collaboration


Coordinate with data scientists, DevOps, and service teams to support platform releases. Deliver on strict project timelines tied to active client deployments.



Qualifications and Skills




Education




Bachelor's/Master's in Computer Science, Engineering, or related discipline.

Experience




5-8 years as a Data Engineer, with at least 2 years in Spark-heavy environments. Prior experience working with Cloudera Data Platform (CDP) in production.

Technical Expertise




Apache Spark (Core, SQL, Tuning) Cloudera CDP: Hive, HDFS, HBase, Impala/Trino Kafka, Airflow, SQL Python and Bash scripting Familiarity with Linux-based environments Exposure to AWS is a plus

Soft Skills




Strong problem-solving mindset Ability to thrive in contractual, delivery-driven settings Clear communication and documentation habits Focus on execution, quality, and speed



Key Competencies




Data Pipeline Ownership Big Data Architecture Execution Agility in Project Timelines Collaborative Implementation Mindset Operational Readiness Success Metrics On-time delivery of assigned pipeline components Stability and performance of Spark workflows in UAT and production Accuracy of data validation and transformation logic * Cross-team satisfaction with deliverables in rollout sprints

Beware of fraud agents! do not pay money to get a job

MNCJobz.com will not be responsible for any payment made to a third-party. All Terms of Use are applicable.


Job Detail

  • Job Id
    JD1702149
  • Industry
    Not mentioned
  • Total Positions
    1
  • Job Type:
    Full Time
  • Salary:
    Not mentioned
  • Employment Status
    Permanent
  • Job Location
    Downtown Core, S00, SG, Singapore
  • Education
    Not mentioned