Details:
- Gather requirements, understand data across structures/dimensions
- Knowledge of data cleaning, wrangling, visualization, and reporting, with an understanding of the best, most efficient use of associated tools and applications to complete these tasks. Experience in MapReduce is a plus.
- Experience processing large amounts of structured and unstructured data, including integrating data from multiple sources.
- Collaborates with analytics and business teams to improve data models that feed business intelligence tools, increasing data accessibility, and fostering data-driven decision making across the organization.
- Implement processes and systems to monitor data quality, ensuring production data is always accurate and available for key stakeholders and business processes that depend on it.
- Performs data analysis required to troubleshoot data-related issues and assist in the resolution of data issues.
- Develop python codes, Modules, generic functions, deliver and deploy the same
- Good understanding of data mining, machine learning, natural language processing, or information retrieval.
- Partner with NLP, engineering and business teams to implement production-ready codes
Min. Qualification:
- A bachelor’s degree in STEM discipline
- Proficiency in Python is a must for data processing, data, and statistical techniques, EDA
- 4+ years of SQL experience (No-SQL experience is a plus). Experience in MapReduce is a plus.
- Good understanding in developing ETL logics and codes
- Experience with schema design and dimensional data modeling
- Experience processing large amounts of structured and unstructured data, including integrating data from multiple sources.
- Experience with machine learning toolkits including, H2O, SparkML or Mahout
- A willingness to explore new alternatives or options to solve data mining issues, and utilize a combination of industry best practices, data innovations, and your experience to get the job done.
- Ability in managing and communicating data warehouse plans to internal clients
- Experience designing, building, and maintaining data processing systems
- Familiarities with data processing and ML libraries like pandas, NumPy, scikit-learn is a must
- Familiarity with Predictive models like Regression, Classification, Clustering, and Forecasting
- Knowledge of technology infrastructure, specifically, big data technologies- Elastic Search/ Spark / H20, etc.
- Familiarity with platforms like google cloud, AWS, Azure, etc.
- Familiarity with Spark/Pyspark
- Hands-on experience on BI solutions like Tableau, Power BI, Qlik, ThoughtSpot, Answer Rocket, Looker etc.
Skills Required:
Python, Machine learning, SQL
Roles:
As a part of the analytics team, you’ll be responsible for designing and developing required data infrastructure and tools, including collecting, storing, processing, and analyzing our data and data systems. You know how to work quickly and accurately, using the best solutions to analyze mass data sets, and you know how to get results.