Shreyas Mahimkar

Senior Data Scientist / Engineer at LiveRamp
📚 Data Scientist at Liveramp | Boston, Massachusetts, United States
Mutual Connections
Loading...
2 Publications
0 Followers
0 Following
3 Questions

👤 About

Skills & Expertise

SQL JavaScript Python Database Management Hadoop Apache Spark Data Visualization Principal Component Analysis (PCA) Linear Regression Data Plus Math AWS (Amazon Web Services) Bash Shell Scripting Random Forest Algorithms DevOps Practices Data Pipeline Automation Machine Learning Frameworks Reporting Tools Parallel Data Processing Large-Scale Data Workflow Management Cloud-Native Data Engineering

Research Interests

Data Analysis Technical Leadership Forecasting Machine Learning Tools Data Scientist Predictive Analytics Market Mix Modeling (MMM) Bayesian Data Integration Causal Impact Analysis Multi-Touch Attribution Modeling Digital Activation Models Revenue Optimization Business Solutions Data Product Development Large-Scale Data Processing Experimental Analysis Pipelines Campaign Reporting Systems Standardized Data Frameworks Hybrid Advertising Models Attribution Metrics Optimization Statistical Analysis and Modeling Data Science Projects Data-Driven Marketing Strategies Cloud-Based Data Solutions Data Science Hackathon Innovations Data Modelling

Connect With Me

💼 Experience

Senior Data Scientist / Engineer

LiveRamp · July 2019 - Present
  • 1. Developed market mix modeling (MMM) framework using Bayesian combiners to synthesize national lift metrics with geo-specific ad performance data (e.g., Boston, Chicago), improving marketing spend attribution accuracy by 40%. This methodology directly quantified incremental ROI across channels, enabling a $1M revenue deal by demonstrating measurable campaign impact to enterprise clients 2. Operationalized first Reach Frequency Model using SparkSQL for US-scale projections, enabling digital clean room activation and driving $1M+ client contract 3. Designed and deployed a linear attribution system integrating causal impact analysis with multi-touch attribution modeling, achieving adoption by 10+ enterprise clients within the first quarter of product deployment 4. Won company hackathons for batch campaign reporting system using A/B testing with Intent-to-Treat (ITT) and Per Protocol (PP) causal frameworks, later productized to onboard 6+ enterprise clients through standardized experiment analysis pipelines 5. Established best practices for experiment QA through SparkSQL optimizations that reduced processing time by 60% across all client implementations1

Data Scientist / Enginner

Data Plus Math, Boston · October 2018 - June 2019
  • 1. Built and implemented may data pipelines for multiple clients, resulting in significant improvements in impression, reach, lift and attribution metrics. In this fast-paced startup environment, I leveraged my skills in data engineering, data science, product management, DevOps, and UI to wear multiple hats and ensure project success. 2. Developed Bash shell scripts to automate system maintenance, data processing workflows, and batch job scheduling using tools like cron and tmux reducing manual errors by 40%

Data Scientist / Enginner

TiVo, Boston · July 2016 - October 2018
  • 1. Leveraged Random Forest algorithms to predict user-level program/channel viewership probabilities, achieving 30% accuracy improvement over previous models 2. Designed predictive analytics solutions using Linear Regression to forecast aggregated TV ratings, informing programming and advertising strategies 3. Conducted statistical correlation analysis on 10M+ viewer records to identify 15+ key behavioral patterns and demographic drivers of TV engagement 4. Established data-driven insights framework guiding $2M+ annual content acquisition decisions through feature importance analysis 5. Built hybrid TV advertising data models across SQL (MySQL), NoSQL (Presto), and cloud storage (S3), reducing query latency by 40% through indexing optimization 6. Architected star schema for ad performance tracking, enabling cross-platform campaign analysis across 50+ metrics 7. Engineered Scala Spark ETL pipelines processing 1.2TB daily of Nielsen ratings data, achieving 99.98% uptime SLA 8. Implemented S3-based data lake with Athena SQL interface supporting 150+ concurrent analysts through partitioned Parquet storage Data Modelling: Created TV Ad data model in NOSQL and SQL databases in Presto, MYSQL and S3. Created TV ratings data model in Scala Spark and storing it into Amazon S3 and accessing it via Amazon Athena. Data Analysis: Extracted actionable insights on TV viewership data (Big data) using zeppelin and matplotlib

Masters Assistant

Northeastern University (NEU/NU), Boston · January 2016 - May 2016
  • Candidate for Masters of Science in Computer Science May 2016 Related Courses: Parallel Data Processing in Map-Reduce, Machine Learning, Algorithms Projects: • Fighting Crime with Big Data – Java, Amazon Web Services – Elastic Map Reduce, S3, Weka: Developed a tool that predicts the crime location in Boston and Chicago streets by running multiple Map-Reduce programs on large dataset of crime using K-D tree Algorithm.

Data Science Intern

Novartis, Cambridge · July 2014 - December 2015

Senior Software Engineer

Infosys · August 2012 - July 2013
  • Worked for a Financial Corporation – Auto Loans for a client in US 1. Designed and optimized PL/SQL packages, procedures, and functions to streamline data retrieval processes, improving query performance by 35%. 2. Developed ETL workflows using PL/SQL for data transformation and migration between Oracle databases, ensuring compliance with business logic for report generation. 3. Resolved critical defects in an Auto-Loan System by debugging PL/SQL scripts and Java code, reducing production incidents by 50%. 4. Automated data movement tasks using Python scripts, integrating with Oracle databases to reduce manual processing time by 25%. 5. Collaborated with cross-functional Agile teams to deliver new system features, including Unix shell scripts for job scheduling and batch processing. 6. Conducted database performance tuning by analyzing execution plans, indexing strategies, and materialized views, cutting latency by 20%. Worked for a Financial Corporation – Auto Loans for a client in US 1. Designed and optimized PL/SQL packages, procedures, and functions to streamline data retrieval processes, improving query performance by 35%. 2. Developed ETL workflows using PL/SQL for data transformation and migration between Oracle databases, ensuring compliance with business logic for report generation. 3. Resolved critical defects in an Auto-Loan System by debugging PL/SQL scripts and Java code, reducing production incidents by 50%. 4. Automated data movement tasks using Python scripts, integrating with Oracle databases to reduce manual processing time by 25%. 5. Collaborated with cross-functional Agile teams to deliver new system features, including Unix shell scripts for job scheduling and batch processing. 6. Conducted database performance tuning by analyzing execution plans, indexing strategies, and materialized views, cutting latency by 20%. Skills: PL/SQL · oracl · Java

🎓 Education

Northeastern University (NEU/NU), Boston

M.SC in Computer Science · 2016

Watumull Institute of Engineering And Technology (WIET)

BE in Computer Engineering · 2010

🚀 Projects

How Mobility Informs Epidemic Dynamics Districts Sierra Leone
Agency Name: HackEbola || Nov 2014 - Present
Question: How does mobility inform epidemic dynamics on the district level in Sierra Leone Data Used/Wrangled/Processed: Sierra Leone Roads; Sub-national infection dataset, Sierra Leone; general demography Sierra Leone (CIA World Factbook); NEJM article for certain parameters Analysis Approach: SEIR Model with Metapopulation Model Approach (Multi-compartments / SEIR for each district and flows between); Preliminary Training & Testing; Graphs/Visuals; GIS; Findings/Limitations: Estimated ~Mobility Weights between Districts; Tested November Predictions Compared to November Observations Cumulative Case Numbers with preliminary measurement of error (need to extend to p-values? need to measure cross-validation by district &/or time); Predicted December Possible Applications: 1) Predict Epidemic Dynamics In Future Over Time & Space (By District); 2) Use Trained, Validated, Tested Model + all current data mobility + case data to predict optimal locations/expansions of mobility checkpoints/restrictions (locations to test/treat/isolate/restrict-movement/etc.) in order to stem epidemic

🏅 Certificates & Licenses (1)

Improving Deep Neural Networks: Hyperparameter tuning, Regularization and Optimization
Event: DeepLearning.AI · Coursera · Issued on April 2018
Improving Deep Neural Networks: Hyperparameter Tuning, Regularization and Optimization

🏆 Awards & Achievements (1)

🏆 ROBO SOCCER EVENT in GENESIS 08
Awarded by: Watumull || Year: 2008
Description

📄 Peer-Reviewed Articles (31)

The rise of AI-powered automation is reshaping job markets worldwide, presenting both opportunities and challenges. AI has the capability to streamline repetitive tasks, improving operational efficiency and reducing the necessity...
Artificial Intelligence (AI) is increasingly becoming a transformative force in the healthcare sector, driving innovations in diagnostics, treatment personalization, and operational efficiencies. This paper examines the significant impact of AI...
The goal of image classification, a critical task in computer vision, is to group images into specified classes according to their visual attributes. To tackle this problem, a variety of...
To effectively manage tomato crops, illnesses must be detected early and accurately, as they can have a substantial impact on output and quality. This study investigates the use of deep...
Facial recognition using AI is a rapidly evolving technology that uses machine learning algorithms and neural networks to identify and authenticate people based on their facial features. This technology has...

📖 Role in Research Journals (9)

Editorial Board Member
Journal: INTERNATIONAL JOURNAL OF NOVEL RESEARCH AND DEVELOPMENT || Publisher Name: IJ Publication
IJNRD
Assistant Editor
Journal: INTERNATIONAL JOURNAL OF NOVEL RESEARCH AND DEVELOPMENT || Publisher Name: IJ Publication
IJNRD
Advisory Member
Journal: INTERNATIONAL JOURNAL OF NOVEL RESEARCH AND DEVELOPMENT || Publisher Name: IJ Publication
IJNRD
Editorial Board Member
Journal: INTERNATIONAL JOURNAL OF NOVEL RESEARCH AND DEVELOPMENT || Publisher Name: IJ Publication
IJNRD
Guest Editor
Journal: INTERNATIONAL JOURNAL OF NOVEL RESEARCH AND DEVELOPMENT || Publisher Name: IJ Publication
IJNRD

📚 Publications (2)

Journal: International Journal of Progressive Research in Engineering Management and Science • November 2021
When it comes to financial markets, information asymmetry, which occurs when different players have access to differing degrees of knowledge, may result in inefficient markets, skewed pricing, and un...
Analyzing Information Asymmetry Financial Markets Machine Learning Information Imbalance Detection Supervised Learning Unsupervised Learning Decision Trees Support Vector Machines Neural Networks Ensemble Methods Feature Engineering Data Preparation Insider Trading Detection Sentiment Analysis Anomaly Detection Stock Price Prediction Market Transparency Macro-Economic Indicators Trade Volume Analysis Price Change Prediction
Journal: International Research Journal of Modernization in Engineering Technology and Science • November 2021
In the competitive landscape of mobile applications, user engagement is a critical factor for success. Deep linking emerges as a powerful technique to enhance user experience by facilitating seamless...
Deep Linking User Engagement Mobile Applications User Retention User Experience Deferred Deep Links Contextual Deep Links Traditional Deep Links User Acquisition Marketing Strategies Feature Navigation Empirical Evidence Best Practices App Loyalty Mobile Growth Intuitive User Interface Seamless Navigation User-Centric Design App Metrics Optimization Mobile App Development
dd