The Data Engineer will design and optimize scalable data pipelines, manage data lakehouse environments, and ensure data quality and governance.
Job Title: Data Engineer (3-6 Years of Experience)
Location: [Bangalore/Hyderabad]
Job Type: Full-time
About the Role:
We are looking for a skilled Data Engineer with 3-6 years of experience in big data technologies, particularly Java, Apache Spark, SQL and data lakehouse architectures. The ideal candidate will have a strong background in building scalable data pipelines and experience with modern data storage formats, including Apache Iceberg. You will work closely with cross-functional teams to design and implement efficient data solutions in a cloud-based environment.
Key Responsibilities:
- Data Pipeline Development:
- Design, build, and optimize scalable data pipelines using Apache Spark.
- Implement and manage large-scale data processing solutions across data lakehouses.
- Data Lakehouse Management:
- Work with modern data lakehouse platforms (e.g.Apache Iceberg) to handle large datasets.
- Optimize data storage, partitioning, and versioning to ensure efficient access and querying.
- SQL & Data Management:
- Write complex SQL queries to extract, manipulate, and transform data.
- Develop performance-optimized queries for analytical and reporting purposes.
- Data Integration:
- Integrate various structured and unstructured data sources into the lakehouse environment.
- Work with stakeholders to define data needs and ensure data is available for downstream consumption.
- Data Governance and Quality:
- Implement data quality checks and ensure the reliability and accuracy of data.
- Contribute to metadata management and data cataloging efforts.
- Performance Tuning:
- Monitor and optimize the performance of Spark jobs, SQL queries, and overall data infrastructure.
- Work with cloud infrastructure teams to optimize costs and scale as needed.
Qualifications:
- Bachelor’s or Master’s degree in Computer Science, Information Technology, or a related field.
- 3-6 years of experience in data engineering, with a focus on Java, Spark and SQL Programming languages.
- Hands-on experience with Apache Iceberg, Snowflake, or similar technologies.
- Strong understanding of data lakehouse architectures and data warehousing principles.
- Proficiency in AWS data services.
- Experience with version control systems like Git and CI/CD pipelines.
- Excellent problem-solving and analytical skills.
- Strong communication and collaboration skills.
Nice to Have:
- Experience with containerization (Docker, Kubernetes) and orchestration tools like Airflow.
- Certifications in AWS cloud technologies
Top Skills
Airflow
Apache Iceberg
Spark
AWS
Docker
Git
Java
Kubernetes
Snowflake
SQL
Sigmoid Bengaluru, Karnataka, IND Office
Bengaluru, Karnataka , India, 560037
Similar Jobs
Financial Services
As a Software Engineer III, develop scalable coding frameworks, produce quality code, optimize software applications, and influence product design within an agile team.
Top Skills:
AWSCockroachdbDatabricksDockerDynamoDBJavaJenkinsKubernetesPythonSnowflakeSparkSpringSpring BootSQL
Cloud • Information Technology • Security • Software
The Senior Platform Software Engineer will lead the development of JumpCloud's open directory platform, focusing on identity, device, and access management solutions.
Top Skills:
AndroidAppleLinuxWindows
Cloud • Information Technology • Security • Software
The Platform Software Engineer is responsible for building a unified open directory platform to manage identities and devices securely.
Top Skills:
AndroidAppleLinuxWindows
What you need to know about the Bengaluru Tech Scene
Dubbed the "Silicon Valley of India," Bengaluru has emerged as the nation's leading hub for information technology and a go-to destination for startups. Home to tech giants like ISRO, Infosys, Wipro and HAL, the city attracts and cultivates a rich pool of tech talent, supported by numerous educational and research institutions including the Indian Institute of Science, Bangalore Institute of Technology, and the International Institute of Information Technology.