Data Analyst – R&D Responsibilities:
▪ Architect a data-based solution for the business problem presented.
▪ Collaborate with different MSIL Departments for their data requests & build solutions that automate their daily tasks thereby saving man hours.
▪ Understand the Time-Series Telematics data generated from the vehicle.
▪ Clean up and Prepare datasets for modeling, get involved in ETL process if required.
▪ Build KPIs/metrics by applying data transformation techniques such as aggregation, resampling, filtering etc.
▪ Exploratory data and get insights. Present the descriptive stats and insights to the domain experts. Find meaningful patterns in data, detect seasonality and trend, establish cause and effect relationships in data. Develop & Test hypothesis in collaboration with the domain experts.
▪ Design features, shortlist features, study feature importance, decide the ML strategy. ▪ Build data pipelines for data extraction, cleaning, transforming, feature extraction, and machine learning.
▪ Data modelling, selection of an appropriate machine learning / deep learning model, data pipeline setup for model training, hyper-parameter tuning, validation and test. Apply ensemble model techniques (if required).
▪ Reporting & Visualization: Comprehension of reports, visualization of data in the form of plots, generate reports using BI tools, develop live updating dashboards.
Technical Skills / Experience:
Essential:
▪ Must have a hands-on experience with Python. Worked on libraries- Numpy, Pandas, Matplotlib.
▪ Experience building and training machine learning models for classification, regression and clustering (e.g. Generalized Linear Models, Boosting, Decision Trees, Neural Networks, SVM, Bayesian Methods, time series models, KMeans, Hierarchical clustering etc.)
▪ Knowledge about summarizing data, generating graphs, charts and reports. Desirable:
▪ Experience with BI tools – Power BI, IBM Cognos etc.
▪ Experience building RESTful APIs.
▪ Experience with back-end/front-end development.
▪ Working experience with cloud computing platforms such as AWS / IBM.
▪ Ability to write scalable SQL queries.
▪ Experience in Spark or other distributed computing frameworks.
▪ Experience in time-series/IoT data analytics.
▪ Exposure to automotive systems, automobile basics, Controller Area Network protocol (CAN protocol) etc.
Behavioral
▪ Excellent interpersonal skills.
▪ Creativity and ability to bring in innovative ideas for Kaizen and solving everyday problem.
Educational Requirement:
BE/ B Tech with 60% marks and certification course/diploma in Data science.