PREET MODI

Business Analyst
Data Scientist


View Resume

01

Know
About me

Current Data Science Graduate student actively looking for full-time new graduate opportunities to contribute and excel in roles spanning Data Scientist, Data Engineer, Data Analyst, Business Intelligence, and Software Engineer positions
Currently working as Research Assistant at IUB
Graduate Teaching Assistant for INFO-I 535 Management, Access, And Use of Big And Complex Data
IBM Certified Data Scientist


SKILLS

Languages Python | SQL | R | Java | C | C++ | JavaScript | C# | Linux | React JS

Database & Tools SQL Server, PostgreSQL, Hive, MongoDB, Tableau, PowerBI, Airflow, Kafka, SAP, SAS, Excel, VS Code, AWS, GCP, PySpark, Epicor, Databricks, Snowflake, Git, Azure, EC2, MATLAB

Data Science ETL, Predictive Modeling, Regression, Classification Trees, Time Series Analysis, Data Warehousing, Natural Language Processing, Hypothesis Testing, Artificial Intelligence, Statistical Analysis, Data Visualization

02

Education

MASTER OF SCIENCE IN DATA SCIENCE

INDIANA UNIVERSITY BLOOMINGTON

AUGUST 2022 - MAY 2024
GPA 3.84

BACHELOR OF TECHNOLOGY IN INFORMATION TECHNOLOGY

DHARMSINH DESAI UNIVERSITY

JULY 2018 - MAY 2022
GPA 4.0

03

My
Experience

  • August 2023 - Present

    Research Data Scientist

    Indiana University School Of Education

    Power BI | Python| Advanced Excel | Tableau | Data Visualization | Carnegie Classification
    Collaborating with Dr. Victor Borden, engaging in data metric analysis, and developing novel interactive visualizations for Carnegie Classification. This entails coding in both R and Python, as well as crafting Power BI widgets

  • August 2023 - Present

    Graduate Teaching Assistant - Big Data Management

    Indiana University Luddy School of Informatics, Computing, and Engineering

    Google Cloud Platform | NoSQL| Cloud Computing | Big Query | AWS | Distributed Computing
    INFO-I 535 Management, Access, And Use of Big And Complex Data- Crafting assignments, conducting fair grading, and providing active support to clarify doubts, ensuring students' comprehensive understanding and success in the subject

  • May 2023 - August 2023

    Business Analytics Intern

    Sacoma Specialty Products, LLC

    SQL | Epicor | Advanced Excel | SAP | Amazon Redshift | Amazon Quicksight | Power BI
    Integrated Epicor and SAP systems with AWS services, utilizing custom Business Activity Queries (BAQs), resulting in a 20% improvement in supply chain efficiency.
    Created a centralized data lake, streamlining data extraction, transformation, and loading (ETL) processes, reducing data processing time by 40% and enhancing data accessibility.

  • October 2022 - May 2023

    Business Analyst

    Indiana University Bloomington

    Power BI | Python| Advanced Excel | Data Analysis | Data Visualization
    Working with Residential Program and Services department to analyze data metrics regarding efficiency of various eateries and calculating cost per meal for students during different phases of the day cycle.
    Utilized DAX(Data Analysis Expressions) to create custom calculations and perform advanced analysis for insightful decision-making

  • December 2021 - April 2022

    Data Science Intern

    Institute for Plasma Research, Department of Atomic Energy (DAE), Government of India

    High Performance Computing | Python | SQL | Javascript | Data Visualization | IBM Watson Studio |Documentation | Node JS | Tableau
    Built a customized, real-time interactive HPC dashboard for administrators to monitor performance metrics of HPC system having impressive capabilities, including a processing power of 1 petaflop, 10,000 CPU cores and 44 GPU cards.
    Utilized a technology stack comprising Python with Flask for the backend, Node.js for the frontend, and incorporated Dash for visualization, enabling the development of a comprehensive application with seamless integration between the different components.

  • September 2021 - December 2021

    Software Engineer Intern

    JP Morgan Chase & Co.

    Apache Airflow | SAS | SQL | Kafka | Kubernetes | SDLC | Testing | Dashboard | MS Power Tools
    Implemented scalable data pipelines, ensuring data quality and integrity.
    Used SAS for statistical analysis, along with Power BI, SQL, and various MS Power tools, to perform data analysis, visualization, and reporting tasks.

04

Papers &
Projects

"An efficient Artificial Neural Network for Coronary Heart Disease Prediction", Volume 9, Issue XII, International Journal for Research in Applied Science and Engineering Technology (IJRASET) Page No: 1474-1483, ISSN: 2321-9653 (Impact Factor: 7.429)

DOI:https://doi.org/10.22214/ijraset.2021.39559

“Insurance Management with Premium Prediction ", Volume 9, Issue XII, International Journal for Research in Applied Science and Engineering Technology (IJRASET) Page No: 1222-1238, ISSN: 2321-9653 (Impact Factor: 7.429)

DOI:https://doi.org/10.22214/ijraset.2021.39416

PROJECTS

Topic Modeling on Customer Reviews - Yelp.com

  • This Yelp.com review analysis project, driven by Python and tools like LDA, Scikit-learn, and Power BI, aimed to enhance user experience and support businesses. It involved data collection (web scraping or Yelp’s API) using Selenium and text preprocessing (spaCy, NLTK, Gensim).
  • An ensemble machine learning model classified reviews, while Latent Dirichlet Allocation (LDA) uncovered topics, visualized with Matplotlib and Power BI. The results provided actionable recommendations for improving customer satisfaction, loyalty, and business performance.

Epicor-Driven Data Enlightenment: Transforming Business Intelligence

  • Implemented a data-driven dashboard to enhance operational efficiency and decision-making. Integrated data from diverse sources and conducted analyses to extract insights, refining data manipulation skills. Utilized Amazon Redshift for data pipelines and Excel Macros for streamlined processing.
  • Developed interactive dashboards using Power BI, Tableau & Quicksight, and in ERP systems like Epicor and SAP. Provided stakeholders with real-time visibility into key indicators, facilitating informed decisions. Wrote Business Activity Queries (BAQ) in Epicor and generated MRP dashboards, achieving a 40% increase in efficiency.

HPC Analytics Dashboard Application Development

  • Engineered an application for High Performance Computing (HPC) with a theoretical peak performance of 1 Petaflop (PF) and an infrastructure comprising over 10,000 CPU cores and 44 GPU cards.
  • Integrated Oracle RDMS for data storage and management. Leveraged Dash, Matplotlib, and Seaborn Python libraries and ReactJS for data visualization.
  • This project entailed implementing Software Development Life Cycle methods such as Agile and Scrum , utilizing Jira for project management. Harnessing problem-solving, communication abilities, and collaborative teamwork, the analysis delivered actionable business insights.

Web Determining the Causal Inference of a new pricing strategy on customer retention rates for an online subscription service (Netflix):

  • Conducted Predictive Analytics to predict customer churn and identify potential factors affecting customer retention.
  • The project involved data collection, preprocessing, and performing A/B testing, followed by statistical analysis using Stata to interpret the results and determine the magnitude of the effect of the pricing strategy on customer retention rates.

Exploratory Data Analysis for Bureau of Transportation Statistics Flight Performance:

  • Implemented a data pipeline, Developed a storage model in NoSQL server, Executed an algorithm using a parallel programming framework using Hadoop
  • Proposed a cleaning improvement solution, Explored a big data cloud platform environment and finally created an reliable data management plan. K-Means Clustering Algorithm was implemented.

Claim Severity Prediction using Computer Vision and Machine Learning:

  • Designed and implemented a state-of-the-art machine learning model utilizing Convolutional Neural Networks (CNNs) and a suite of Computer Vision libraries, including OpenCV, TensorFlow, and Detectron2, Meta AI’s platform for object detection and segmentation.
  • This model accurately predicted auto insurance claim severity based on images of damaged vehicles, achieving an outstanding 95% accuracy rate in distinguishing repairable from total loss cases.

Compiler Lexical and Syntax Analyzer Phase Design for String, Char & Integer Operations

  • Deployment of lexical and syntax analyzer phase of the compiler for a devised set of grammar rules and conditions for variable declaration that reports error in case of syntax misalignment using C, Flex and Bison technologies

05

AWARDS &
SEMINARS

<

AIR(ALL INDIA RANK) - 40

  • Secured an AIR(ALL INDIA RANK) -40 at National Creativity Aptitude Test conducted across 198 institutes in India - 2020.

WINNER - QUIZOPHILE

  • Winner of Annual Inter-Institute Quiz Competition organized by Computer Society of India (CSI) at Dharmsinh Desai University

RESEARCH SEMINAR

  • Participated 5-day workshop at Innovation in Science Pursuits for Inspired Research (INSPIRE) held by Department of Science and Technology, Government of India

Developer Student Clubs Dharmsinh Desai University powered by Google · Apprenticeship

  • Used to conduct various informative sessions on new technologies and features of Google Assistant and Google Cloud Platform
  • Tutored the new members on making an application on Google Assistant and winning exciting prizes like Google Home, badges, bags,etc.
  • Developed 3 Apps on Google Assistant which are running successfully.

06

Contact Me

📞 Mobile Number

+1 812 318 2011

📧 Mail

preetjmodi@gmail.com

📧School Mail

prmodi@iu.edu

Social Network