Data Engineer Lead/Architect ( Hadoop )
Long Term Contract
Role and Responsibilities
Design, build, and implement scalable streaming data pipelines and ETL frameworks to increase data access and decrease analysis and decision times across the organization
Own software throughout the entire development life-cycle design, code, test, automate & deploy
Share ideas to improve our product and processes, and provide feedback
7+ years of experience in defining data architecture solutions and establishing common data capabilities for enterprises
Proven experience in creating actionable Data and Analytics strategies for Compliance, Risk, Financial Intelligence business functions
Experience in defining technology blueprints, roadmaps and collaboratively defining solutions and enabling architecture capabilities
Experience in tool selection, conducting rapid PoCs and recommending use case appropriate technologies
3+ years experience building distributed solutions in Spark, Map Reduce and other MPP system with associated data models and datastores (e.g., Redshift, Cassandra, HBase, Parquet)
2+ years of experience working with AWS Cloud data engineering stack including EC2, S3, EMR, Kinesis, Glue and other AWS Services
Hands-on experience with Apache Ni-Fi, Kafka, Python, Spark preferably on AWS
Experience with structured/unstructured/semi-structured data ingestion and processing
Experience with automation and deployment (Jenkins, Cloud Formation, Chef etc.)
Experience writing high quality code in Python and one another OOP language (Java, Scala, C++, Go, etc)
Experience working with RDBM systems, particularly familiarity with SQL
Solid Experience in optimizing the Hive queries using Partitioning and Bucketing techniques, which controls the data distribution, to enhance performance.
Experience in working with UNIX shell scripts.
Production development of event-based applications using frameworks such as Kinesis, Kafka, Spark Streaming, or similar
Familiarity with machine learning techniques, continuous deployment pipelines and tools.
Desire to work across internal teams to identify requirements and iterate on solutions
Debug complex production issues across various levels of the tech stack
Thanks & Regards,
James Smith Sr. Technical Recruiter
Kaizen Technologies Inc.
Direct: Ext : 1215
Spark, MapReduce, AWS Cloud, Apache, Kafka, Python, Jenkins, RDBMS, UNIX
Long Term Contract