Customer-focused, Self-driven, and Motivated with a strong work ethic and a passion for problem-solving.
5+ years of designing, implementing, tuning, and managing services in a distributed, enterprise-scale on-premise, and public/private cloud environment.
Familiarity with infrastructure management and operations lifecycle concepts and ecosystem.
Hadoop cluster design, Implementation, management, and performance tuning experience with HDFS, YARN,
HIVE/IMPALA, SPARK, Kerberos and related Hadoop technologies are a must.
Must have strong SQL/HQL query troubleshooting and tuning skills on Hive/HBase.
Must have a strong capacity planning experience for Hadoop ecosystems/data lakes.
Good to have hands-on experience with – KAFKA, RANGER/SENTRY, NiFi, Ambari, Cloudera Manager, and HBASE.
Good to have data modeling, data engineering, and data security experience within the Hadoop ecosystem. Good to have deep JVM/Java debugging and tuning skills.
Certification on any of the leading Cloud providers (AWS, Azure, GCP ) and/or Kubernetes is a big plus.