The company is hiring a Senior Data Scientist who has extensive experience in machine learning to solve business problems. How are you [Email Address Removed]
- Strong working knowledge of software development tools, techniques, and approaches used to create application solutions
- Implements analytical algorithms for object detection, segmentation, classification and recognition
- Is responsible for uncovering information and identifying business opportunities using algorithmic, statistical and data mining techniques
- Plays a strategic role in creative ideas to leverage the company’s vast collection of data into breakthrough new IT solutions
- Establishes accurate and scalable analysis systems
- Interprets the results of statistical and predictive experiments and regression analyzes and integrates them into complex business processes
- Performs customer specification studies, gathers requirements, performs system architectural design and transforms requirements into final product
- Provides comprehensive development, deployment, and application lifecycle operations support for big data solutions and infrastructure
- Collaborates with various team members and facilitates the development, automation and seamless delivery of analytics solutions in Big Data clusters
- Responsible for data warehousing and ETL using Informatica Power Center
- Analyzes data and creates reports using data visualization tools such as Tableau, Cognos, MicroStrategy
- Imports and exports data using Sqoop from HDFS to relational database systems and vice-versa
- Codes and tests Standardization, Normalization, Load, Extract and AVRO models to filter/massage data and its validation
- Installs configures and uses ecosystem components like Hadoop Map Reduce, Spark, Hive, Sqoop, Pig, HDFS, HBase, Cassandra, ZooKeeper, Oozie, Hue, Impala and Flume
Minimum requirements
- 10+ years of deep experience in the data science industry in a fast-paced and complex business environment and with top-tier teams
- Extensive experience in machine learning to solve business problems
- Agile work experience
Tech stack:
- Language: Python, Scala, SQL, Java, PL/SQL
- Web technologies: Web Service, SOAP, Rest web services, JSP
- Big Data ecosystem: Spark, HDFS, Yarn, Map Reduce, Hive, Pig, Sqoop, ZooKeeper, Kafka, Oozie, Hue, Impala, Flume
- Scripting language: HTML, JavaScript, CSS, XML and Ajax
- Machine learning: R, SAS, SKLearn, MATLAB, Octave, Spark ML
- No SQL database: Cassandra, HBase, MongoDB, Vertica
- Cloud: AWS, Azure Cloud
- Operating system: Windows, Linux and Unix
- BI Tools: Informatica 9.5/9.1/8.6, Tableau, Cognos
- DBMS / RDBMS: Oracle 12c/11g, SQL Server 2014, DB2, Teradata 14/12, AWS Redshift
- IDE: Eclipse, Jupiter Notebooks, Microsoft Visual Studio, Flex Builder, Spyder, TOAD, NetBeans, PL/SQL Developer, Putty, Squirrel SQL
- Version control: SVN, CVS, Git and Rational Clear Case
- Tools: FileZilla, JUnit, Splunk, Clear Quest, Rally, Jira, Confluence, Bitbucket
Find out more/Apply to this position