Requirements:
- 3+ years of experience as a Data Scientist, preferably in Big Data Environment
- 2+ years of programming experience in Java/Scala and/or Python
- Hadoop stack (HIVE, Pig, Hadoop streaming) and MapReduce
- HBase or comparable NoSQL
- SQL & database experience
- Experience with Google products: Google Cloud Storage, Google Analytics and Google Big Query (a plus)
- Bachelor’s degree in quantitative or related field
Responsibilities:
- Design and build predictive customer behavior models for targeting and personalization
- Implement Machine Learning and statistics-based algorithms for prediction and optimization, then deliver to production
- Build and maintain code to populate HDFS, Hadoop with log from Kafka or data loaded from SQL production systems
- Design, build and support algorithms of data transformation, conversion, computation on Hadoop, Spark and other distributed Big Data Systems