STAG INNOVATION PVT LTD

+916302720939 info@staginvs.com

>>A job description involving Apache Spark, Apache Superset, Apache Flink, and Python typically entails designing, developing, and maintaining data pipelines and analytical solutions using these technologies. >>>The role will involve working with large datasets, building and optimizing data processing workflows, and creating interactive dashboards for data visualization. Proficiency in Python programming and familiarity with the respective libraries and frameworks is essential.
>>>Here’s a more detailed breakdown of the key aspects of such a role:
…Core Responsibilities:
Data Pipeline Development:
Design, build, and optimize data pipelines using Apache Spark for large-scale data processing and transformation.
Real-time and Batch Processing:
Implement both real-time and batch processing solutions using Apache Flink and Spark, respectively.
Data Visualization:
Develop interactive dashboards and visualizations using Apache Superset to enable data exploration and analysis.
Python Programming:
Utilize Python for scripting, data manipulation, and interacting with Spark, Flink, and Superset APIs.
Performance Optimization:
Optimize data processing workflows for speed and efficiency using techniques like caching, partitioning, and query optimization.
Collaboration:
Work closely with data scientists, data engineers, and other stakeholders to understand requirements and deliver impactful data-driven solutions.
Troubleshooting and Maintenance:
Identify and resolve issues within data pipelines and applications, ensuring their stability and reliability.
Cloud Environments:
Leverage cloud platforms like AWS, Azure, or GCP for deploying and managing Spark, Flink, and Superset clusters.
Required Skills and Experience:
Strong Python programming skills: Experience with Python libraries like Pandas, NumPy, and others relevant for data manipulation and analysis.
Proficiency in Apache Spark: Understanding of Spark core concepts, RDDs, DataFrames, and Spark SQL.
Experience with Apache Flink: Familiarity with Flink’s streaming capabilities, state management, and windowing mechanisms.
Knowledge of Apache Superset: Experience in designing and building interactive dashboards and visualizations.
Data Modeling and ETL: Experience with designing data models and implementing ETL (Extract, Transform, Load) processes.
>>>Apache Spark, Apache Superset, Apache Flink together with Python coding
>>>9+ years of experience. US Timings. Remote position. www.staginvs.com

No of Vacancies: 4

Print Job Listing

Apache Spark, Apache Superset, Apache Flink together with Python coding

Bookmark Details

STAG INNOVATION PVT LTD

Job Overview

Related Jobs

Oracle Data WH

junior CAD engineer

3D modeling on leica cyclone software .

Java Full Stack Developers

Oracle VBCS

Senior Database Developer

Share

Facebook

X

LinkedIn

Telegram

Tumblr

Whatsapp

VK

Mail