Senior Data Scientist #3862
Senior Data Scientist, Warren, MI, General Motors. Design, develop, & create data model pipeline consisting of data from heterogeneous data sources including customer text, warranty & electric vehicle (EV) datasets, using R, Python & SQL programming languages, pySpark API, Hive data warehouse, & multi cluster Hadoop environment. Perform data discovery, data mining, predictive modeling, simulation, & statistical analysis of data by observing the distribution of random variables & features in EV & diagnostics datasets, building & using machine learning algorithms (models) such as Linear/Logistic Regression, Support Vector Machines, Random Forest, Gradient Boosting & Deep Learning. Process geospatial files of new car dealership & repair centers location data & combine with connected vehicle data including diagnostics data, location data, DTCs data, & EV charging data sourced through OnStar VCIM. Design & develop analytic web applications & dashboard to show output of predictive algorithms using PowerBI leau. Bachelor, Computer Science, Electrical or Computer Engineering, Math, or related. 60 months experience as Data Scientist, Architect, Analyst, Programmer Analyst, or related, developing or creating data pipeline, consisting of text data, or lifecyle data, or warranty data, using Python &SQL programming languages, & Hive data warehouse, or related.