Infotel UK Consulting Logo

Infotel UK Consulting

Data Engineer - Python

Posted 6 Days Ago
Be an Early Applicant
In-Office
Bengaluru, Bengaluru Urban, Karnataka, IND
Mid level
In-Office
Bengaluru, Bengaluru Urban, Karnataka, IND
Mid level
Design, build, test, and deploy ETL/ELT data pipelines and data‑warehouse solutions for AML/compliance domains. Produce technical specifications, coordinate stakeholders, automate data ingestion controls/tests, implement DevOps practices, and support releases while ensuring data quality, lineage, and regulatory requirements are met.
The summary above was generated by AI

The candidate should also have good knowledge of ETL design and be able to develop optimized code. Working experience on PySpark and Scala programming is an added advantage.


Responsibilities

·         Conduct detailed validation of functional specifications (eventually contribute to functional specifications if needed).

·         Initiate, build, and contribute to technical specifications.

·         Perform technical and/or data analysis to elaborate technical Specifications documents for the different IT stakeholders (IT 2S data provider, CIB datahub, AML dev teams)

·         Assist the technical stakeholders to validate the solutions and validate the technical tests planned.

·         Coordinate with the different stakeholders to implement the changes/evolutions in the delay, cost and quality expected.

·         Control the completion of the technical tests and associated deliverables.

·         Support and contribute to the releases organization.

·         Ensure data ingestion controls and technical tests automation developments are done according to expectations

  • Implement DevOps practices to ensure efficient and reliable deployment of data pipelines and ETL processes,

Direct Responsibilities

  • Understand business requirement from business analysts, users and should have analytical mind to understand existing process and purpose better solutions
  • Work on TSD designs, development, testing, deployment, support
  • Suggest and implement innovative approach.
  • Should be adaptable to new technology or methodology

Contributing Responsibilities

  • Contribute towards knowledge sharing initiatives with other team members
  • Contribute documentation of solutions and configurations of the models
  •  

Technical & Behavioral Competencies

Mandatory

    • 3+ years of experience in Corporate and Institutional Banking IT, with a full understanding of the Corporate Banking and/or Securities Services activity.
    • Good understanding of AML monitoring tools and data needed for AML detection models.
    • Good understanding of Data Analysis and Data Mapping processes.
    • Extensive experience in working with functional and technical teams, defining requirements (mainly technical specification), establishing technical strategies, and leading the full life cycle delivery of projects.
    • Experience in Data-Warehouse architectural design providing efficient solutions in Compliance AML data domains.
    • Good Experience in Python developments, Oralce PL/SQL development
    • Excellent communication skills with the ability to explain complex technical issues in a simple concise manner.
    • Strong coordination and organizational skills.
    • Multi-tasking capabilities

All these qualifications are a plus:

    • Knowledge of Corporate Banking and Securities Services transactional data sources, flowing through the Compliance and Regulatory frameworks is a plus.  
    • Knowledge of Swift message and/or MX message formats and relevance to AML monitoring 
    • Experienced in implementing various data lineage mechanisms to meet regulatory requirements.

Success in the role is heavily dependent on the ability to show leadership, proactivity, and work cooperatively with both functional and technical teams, onshore and offshore

Specific Qualifications:

Python, Oracle,

Skills Referential (Required knowledge, skills and abilities)

Technical Skills:

  • Programming & Scripting (Primary)
    • Advanced Python (3.x) – OOP, typing, async, performance profiling
    • Familiarity with Python data‑engineer libraries: pandas, pyarrow, sqlalchemy, cx_Oracle, oracledb,polars, duckdb
    • Shell scripting (bash, PowerShell) for automation and orchestration
  • Oracle Database Expertise (Primary)
    • Oracle Database (11g/12c/19c/21c) administration basics
    • SQL proficiency: complex queries, analytic functions, hierarchical queries, PL/SQL development
    • Data modeling (ER, dimensional) and schema design for OLTP & OLAP
    • Performance tuning: indexing, partitioning, optimizer hints, AWR/ASH analysis
    • Oracle Data Pump, SQL*Loader, External Tables
  • Data Integration & ETL (Primary)
    • Design and implementation of ETL/ELT pipelines in Python (e.g., polars,pandas, pySpark, dbt)
    • Knowledge of messaging/streaming (Kafka, RabbitMQ) for real‑time ingestion
    • Data orchestration platforms: Apache Airflow, Prefect, or Azure Data Factory
  • Big Data & Distributed Processing (Secondary)
    • Working knowledge of Apache Spark (PySpark) and its integration with Oracle
    • Experience with cloud‑based big‑data services (AWS EMR, Azure Synapse, GCP Dataproc)
  • Cloud & DevOps (Secondary)
    • Oracle Cloud Infrastructure (OCI) services: Autonomous DB, Object Storage, Functions
    • Containerization (Docker) and orchestration (Kubernetes) for scalable pipelines
    • CI/CD pipelines (Git, Jenkins, GitHub Actions) for automated testing and deployment
    • Infrastructure‑as‑Code tools (Terraform, OCI Resource Manager)
  • Data Quality & Governance (Primary)
    • Implementing data validation, profiling, and cleansing in Python
    • Familiarity with data lineage, metadata management, and catalog tools (Apache Atlas, Collibra)
    • Understanding of GDPR, CCPA, and other data‑privacy regulations
  • Testing & Monitoring (Primary)
    • Unit & integration testing frameworks (pytest, unittest) for data pipelines
    • Monitoring & alerting (Prometheus, Grafana, OCI Monitoring) of ETL jobs and database health
  • Version Control & Collaboration (Secondary)
    • Proficient with Git (branching, pull‑requests, code reviews)
    • Agile methodologies (Scrum/Kanban) and ticketing systems (Jira, Azure Boards)

Behavioral Skills:

    • Ability to collaborate / Teamwork
    • Communication skills - oral & written
    • Creativity & Innovation / Problem solving
    • Ability to share / pass on knowledge

Education Level: Bachelor Degree or equivalent

Location: Bangalore


Similar Jobs

3 Days Ago
In-Office
Bangalore, Bengaluru Urban, Karnataka, IND
Senior level
Senior level
Healthtech
Design, build and operate production ETL/ELT data pipelines in Python and FastAPI, maintain Spark/Scala workloads, ensure data quality, lineage and observability, collaborate with product and AI teams to deliver AI-ready data products, and apply testing, CI/CD and schema management.
Top Skills: AirflowSparkArgocdAWSCi/CdClaude CodeDbtEtl/EltFastapiKafkaPysparkPythonScalaSQLTerraform
10 Days Ago
In-Office
Bangalore, Bengaluru Urban, Karnataka, IND
Mid level
Mid level
Artificial Intelligence • HR Tech • Professional Services • Software
Develop and maintain Python- and SQL-based analytics, reports, dashboards, and RESTful APIs. Collaborate with Product Management and Professional Services to gather requirements, estimate projects, ensure data accuracy, optimize queries, and deliver customer-facing analytics solutions. Master data models and data warehouse architectures and generalize custom reports where possible.
Top Skills: ClickhouseData Visualization LibrariesGitNumpyPandasPower BIPythonRestful ApisSnowflakeSQLTableau
10 Days Ago
In-Office
Bengaluru, Bengaluru Urban, Karnataka, IND
Senior level
Senior level
Big Data • Cloud • Digital Media • Machine Learning • Mobile • Software • Industrial
As a Senior Data Engineer, you'll design and improve data pipelines, ensure data quality, and collaborate with stakeholders to maximize data usage for analytics and operational needs.
Top Skills: AirflowApache ZeppelinAWSAzureEmr NotebooksFlinkGCPGitJenkins CiJupyterLookerPower BIPythonSnowflakeSparkSQL

What you need to know about the Bengaluru Tech Scene

Dubbed the "Silicon Valley of India," Bengaluru has emerged as the nation's leading hub for information technology and a go-to destination for startups. Home to tech giants like ISRO, Infosys, Wipro and HAL, the city attracts and cultivates a rich pool of tech talent, supported by numerous educational and research institutions including the Indian Institute of Science, Bangalore Institute of Technology, and the International Institute of Information Technology.

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account