Proficient in one or more of modern programming language such as Java, Python. Proficiency in Analytics Packages like R, SAS, Matlab.
Experience and ability to work in a Unix/Linux environment, and proficient in command-line scripting
Ability to implement, maintain, and troubleshoot big data infrastructure, such as distributed processing paradigms, stream processing(Storm,spark), search api(Solr) and databases, such as Hadoop,HBASE,HIVE,SQL etc.
Strong mathematical background with ability to understand algorithms and methods from a mathematical viewpoint and an intuitive viewpoint
Strong data extraction and processing, using MapReduce, Pig, and/or Hive preferred