Duration: Long term
Ultimate is managing group building out a data warehouse
Creating an enterprise DW on Hadoop stack. We’re using Hortonworks but that moved into cloudera. They are moving into Hadoop and eventually into cloud.
Focusing on hive & spark
Data lake team, will also be working on the cluster
Will be bringing HDFS data from the lake over with raw events and populating from there into the warehouse and on the front end using a tool called looker for the BI piece
Project will be End to end from data lake to BI front end.
Someone who has 3-5 year experience on Hadoop, focused primarily on data warehouse, Kafka would be good too.
Joined a governance group, using ranger and atlas for metadata
Hasn’t interviewed anybody so far, has 2 resources including himself.
1. Skillset with Hadoop, Hive and data warehousing HDFS and Kafka as well.
2. Data warehousing exp, data modeling how to handle changing dimensions, diff types, basically understanding of what you do from a warehouse perspective and
3. Someone with overall IT experience, people who have experience within the industry for 10 + years senior level person
Communication skills are very important, still need to fit into the Ultimate Software culture and especially because they want to convert
SQL important because hive is built on SQL
They use ER Studio for data modeling, but not required to know that tool, anybody that has general data modeling experience can pick it up in a few hours.