QUALIFICATIONS AND EDUCATION REQUIREMENTS
Bachelors Degree in a computer-related field (Computer Science, Software Engineering, Information Systems) or equivalent experience required. Masters degree preferred.
WORK EXPERIENCE REQUIREMENTS
9+ years of Information Technology experience with track record of multiple large Business Intelligence (BI), Data Warehouse (DW) or Big Data Analytics project completions.
Deep ETL expertise.
Excellent design skills, including ability to translate complex data flows/transformations into sequence of steps that can be implemented using ETL tools.
Demonstrated ability to develop ETL systems that are efficient, reliable, recoverable, well-documented, auditable, parameter-driven, and maintainable.
Knowledge with a variety of ETL,data quality, data cleansing, and data blending tools.Alteryx highly preferred. Familiarity with AWS Data Pipeline a plus.
Experience with data profiling, metadata management, address cleansing, high availability, server tuning concepts (parameters, resources, contention) and parallelism into ETL flows is preferred.
Proven ability in managing databases up to 200TB in size.
Able to write & tune SQL queries (MySQL).
Full understanding of MPP columnar (Redshift),NoSQL (MongoDB), graph (neo4j) database architecture. Knowledge of one or more OLAP (e.g. Mondrian), reporting (e.g. Jasper Reports), visualization tools (e.g. Tableau).
Comfortable in both Windows and Linux environments.
Understanding and ability to participate in all phases of the SDLC including requirements gathering, business analysis, configuration management and quality control. Working knowledge of a project management and bug tracking tool.Familiarity with Entity-Relationship (ER) / UML modeling and diagramming tools (like ERwin or ER/Studio).
Excellent analytical and communication skills, including ability to work effectively with business users and cross-functional technical teams. Attention to details.
Good interpersonal, verbal and written communication skills. Excellent documentation skills.
Can-do attitude - startup experience preferred.
Comfortable working in a fast paced environment.
Ability to multi-task and work on multiple projects while under pressure.
Experience in a high technology start-up in a B2B context.
Experience with Cloud IT, for example AWS EC2 instance / RedShift or EMR cluster monitoring, optimization and backup / restore / disaster recovery.
Working knowledge of a programming or statistical language like Java or R.
Familiaritywith the following: Hadoop ecosystem (AWS EMR / Cloudera, Hive, Spark, Flume, Giraph); Natural Language Processing (NLP), Named Entity Recognition (NER), part of speech (POS) tagging; Statistics, Machine Learning(Mahout, R, Weka, RapidMiner, KNIME); Search (IR Lucene / Solr, ElasticSearch, CloudSearch); and/or social network analysis.