Hi, I'm Simaant

I'm a data enthusiast, a music lover, a foodie and a travel buff

About me

I have a Master's degree in Information Management with a specialization in Data Science from Syracuse University, New York. I enjoy working with data and love to generate visualizations, predictive models in order to understand what the data is trying to say.

I believe that data is the most valuable resource in today's age and if harnessed and used effectively, it can transform human lives in a positive way.

Also, I've developed my skills in the domain of Data Analytics, Data Visualization, Machine Learning by using Python, R Programming, SQL, Tableau and Power BI. I'm currently working as a Data Scientist at an organization called Ascend innovations

Download Resume
Python
R Programming
SQL
Tableau
Excel
Hi, This is me, Simaant

Work Experience

Data Scientist

Ascend Innovations - Dayton, OH
December 2020 - Present

Utilizing healthcare data from SQL databases, and incorporating Python and R for data analysis and developing machine learning algorithmns for providing insights and suggestions to healthcare organizations about general public health in the area

Business Data Analyst Intern

City of Syracuse / iConsult - Syracuse, NY
February 2020 - May 2020

Focused on utilizing ETL pipeline for data generated in Syracuse city for data cleaning and effective management. Created Power BI dashboards to determine trends for understanding the cause of violations and complaints in the city

Graduate Research Assistant

Syracuse University - Syracuse, NY
February 2020 - May 2020

Compiled 1M domain of websites for web scraping for extracting medical domains from those and implemented topic modeling for determining most important medical terms

Data Analyst

iConsult - Syracuse, NY
February 2019 - May 2020

Collaborated with multiple teams to understand patient data in Syracuse region. Implemented SSIS packages for data extraction and storage in database and created Tableau visualizations to determine the diseases prevalent in Syracuse

Education

Syracuse University

Master's in Information Management
August 2018 - May 2020

Relevant Coursework: Data Science, Big Data, Database Management, Neural Networks

University of Mumbai

B.Tech in Electronics and Telecommunication
August 2014 - June 2018

Relevant Coursework: Data Structures, Statistics, Data Analysis, Economics & Management

Certifications

IBM Data Science Capstone

Data Science Capstone Project by IBM and Coursera
Link to Certificate


Machine Learning

Machine Learning by Coursera and Andrew Ng
Link to Certificate

Projects

Here are some of the projects that I've worked on

(Click on the images for detailed information)

×

Patient Readmittance Analysis

The goal of the project was to determine the chances of patient being readmitted to a hospital. The different features included gender, weight, race, admission type and also some patients had some underlying conditions for diabetes and other illness. The patient data was collected from 1991-2008 and teh dataset was obtained from UCI Machine Learning Repository

Skills: Python, PySpark, Numpy, Pandas, Seaborn, Regression, Random Forest, Jupyter Notebook, Tableau

Project Link

IBM Data Science Project

The project aimed to determine the best neighbourhoods and boroughs to open a Chinese Restuarant in New York City.

Skills: Python, Numpy, Pandas, Seaborn, JSON, K-Means, Jupyter Notebook

Project Link Medium Article

Customer Satisfaction Ratings for Airlines

The goal of the project was to determine the satisfaction ratings between 1-5 (1 being lowest) of traveling with different airlines in the States. The different features that were utilized where age, gender, class of travel, airport, airlines, etc.

Skills: R, R Studio, ggplot, Regression, Associative Rule Mining, SVM

Project Link

Quora Insincere Questions Classification

The objective of the project was to classify the questions posted the users on Quora as sincere or insincere and to understand the distribution and the reason behind the classification. The dataset was obtained from Kaggle.

Skills: Python, Numpy, Pandas, Seaborn, Logistic Regression, SVM, CNN, LSTM, Jupyter Notebook

Project Link

Netflix Ratings Distribution

The goal of the project was to understand the distribution of ratings of 1000 shows and movies on Netflix till 2017 and to visualy represent the trends

Skills: Tableau, Tableau API, MS Excel

Project Link Tableau Public Profile

Spotify Top 200 Daily Streams

The goal of the project was to create cvisualizations for the songs streamed on Spotify in the Top Daily Streams category, throughout

Skills: Tableau, MS Excel

Project Link Tableau Public Profile

Data Warehousing Project

This project aimed to develop a data warehouse for the merger of two corporations, the first one being a movie renting service and the other one being an online retailer. The second objective was to identify the number of lag days from when a particular order is placed and to genrate insights from the data obtained for improving the delivery service.

Skills: MS SQL Server, Visual Studio, SSIS, SSAS, SQL, MS Excel, MS Power BI

Project Link

Healthcare Database Management System

The goal of the project was to develop a database management system for a hospital for better management, storage and analysis of the employees and patient visiting the hospital.

Skills: MS SQL Server, SQL, MS Access

Project Link