2
Publications
0
Followers
0
Following
3
Questions
About
Shreyas Mahimkar is a skilled Data Scientist at LiveRamp, with extensive experience in the consumer electronics industry. He excels in Python, Java, Apache Spark, and database management, holding a Master's Degree in Computer Science from Northeastern University. Shreyas has previously worked at Data Plus Math and TiVo, where he developed data pipelines, predictive models, and clustering methods to enhance TV viewership insights. His projects include predicting crime locations using big data and developing a web crawler. He has also interned at Novartis, contributing to big data analysis with Spark. Shreyas's background combines strong technical skills with practical experience in data science.
Shreyas Mahimkar is a seasoned Senior Data Scientist and Engineer at LiveRamp with over five years of experience in leveraging data to drive business solutions and innovation. His expertise lies in transforming complex data into actionable insights, leading multiple Proof-of-Concept projects, and building innovative data products that enhance campaign reporting and marketing efficiency. Proficient in Python, Java, SQL, and Apache Spark, Shreyas has contributed significantly to revenue growth through technical leadership and collaboration on cutting-edge data science projects.
At LiveRamp, Shreyas has been instrumental in designing and deploying advanced data science solutions. His contributions include developing a Market Mix Modeling (MMM) framework that utilizes Bayesian combiners to integrate national and regional ad performance data, improving marketing spend attribution accuracy by 40%. This innovative approach helped secure a $1 million revenue deal by demonstrating incremental ROI to enterprise clients. He also operationalized the first Reach Frequency Model using SparkSQL for large-scale digital activation, leading to a multi-million-dollar client contract. Shreyas’s efforts in building a linear attribution system integrating causal impact analysis and multi-touch attribution modeling led to rapid adoption by 10+ enterprise clients in the first quarter of deployment.
Apart from product development, Shreyas has excelled in creating standardized experimental analysis pipelines, winning company hackathons for innovative batch campaign reporting systems based on Intent-to-Treat (ITT) and Per Protocol (PP) causal frameworks. His optimizations in SparkSQL reduced client implementation processing time by 60%, showcasing his ability to enhance efficiency across data operations.
Before joining LiveRamp, Shreyas served as a Data Scientist and Engineer at Data Plus Math, where he built and implemented multiple data pipelines for various clients. He improved impression, reach, and attribution metrics, effectively wearing multiple hats in data science, engineering, DevOps, and product management. His contributions included automating system maintenance and data workflows with Bash shell scripts, reducing manual errors by 40%.
Prior to this, Shreyas worked at TiVo for over two years as a Data Scientist and Engineer. He developed predictive models using Random Forest algorithms to forecast user-level TV viewership probabilities, improving model accuracy by 30%. He also designed predictive analytics solutions with linear regression to forecast TV ratings, helping shape programming and advertising strategies. His work in data modeling and processing large-scale data, including building hybrid advertising models and engineering Scala Spark ETL pipelines, optimized query performance and improved overall system efficiency.
Shreyas’s academic foundation includes a Master’s degree in Computer Science from Northeastern University, where he specialized in machine learning, algorithms, and parallel data processing. He worked on projects like "Fighting Crime with Big Data," developing predictive tools using Hadoop and AWS. During his internship at Novartis, Shreyas built a Principal Component Analysis (PCA) tool on Spark, enabling analysts to work with large datasets effectively.
Shreyas has earned several certifications in machine learning and deep learning from Coursera, reflecting his commitment to continuous learning. With a strong foundation in data science, engineering, and product development, Shreyas is a dynamic professional who continues to deliver impactful solutions at the intersection of technology and business.
Skills & Expertise
SQL
JavaScript
Python
Database Management
Hadoop
Apache Spark
Data Visualization
Principal Component Analysis (PCA)
Linear Regression
Data Plus Math
AWS (Amazon Web Services)
Bash Shell Scripting
Random Forest Algorithms
DevOps Practices
Data Pipeline Automation
Machine Learning Frameworks
Reporting Tools
Parallel Data Processing
Large-Scale Data Workflow Management
Cloud-Native Data Engineering
Research Interests
Data Analysis
Technical Leadership
Forecasting
Machine Learning Tools
Data Scientist
Predictive Analytics
Market Mix Modeling (MMM)
Bayesian Data Integration
Causal Impact Analysis
Multi-Touch Attribution Modeling
Digital Activation Models
Revenue Optimization
Business Solutions
Data Product Development
Large-Scale Data Processing
Experimental Analysis Pipelines
Campaign Reporting Systems
Standardized Data Frameworks
Hybrid Advertising Models
Attribution Metrics Optimization
Statistical Analysis and Modeling
Data Science Projects
Data-Driven Marketing Strategies
Cloud-Based Data Solutions
Data Science Hackathon Innovations
Data Modelling
Connect With Me
Experience
Senior Data Scientist / Engineer
- 1. Developed market mix modeling (MMM) framework using Bayesian combiners to synthesize national lift metrics with geo-specific ad performance data (e.g., Boston, Chicago), improving marketing spend attribution accuracy by 40%. This methodology directly quantified incremental ROI across channels, enabling a $1M revenue deal by demonstrating measurable campaign impact to enterprise clients 2. Operationalized first Reach Frequency Model using SparkSQL for US-scale projections, enabling digital clean room activation and driving $1M+ client contract 3. Designed and deployed a linear attribution system integrating causal impact analysis with multi-touch attribution modeling, achieving adoption by 10+ enterprise clients within the first quarter of product deployment 4. Won company hackathons for batch campaign reporting system using A/B testing with Intent-to-Treat (ITT) and Per Protocol (PP) causal frameworks, later productized to onboard 6+ enterprise clients through standardized experiment analysis pipelines 5. Established best practices for experiment QA through SparkSQL optimizations that reduced processing time by 60% across all client implementations1
Data Scientist / Enginner
- 1. Built and implemented may data pipelines for multiple clients, resulting in significant improvements in impression, reach, lift and attribution metrics. In this fast-paced startup environment, I leveraged my skills in data engineering, data science, product management, DevOps, and UI to wear multiple hats and ensure project success. 2. Developed Bash shell scripts to automate system maintenance, data processing workflows, and batch job scheduling using tools like cron and tmux reducing manual errors by 40%
Data Scientist / Enginner
- 1. Leveraged Random Forest algorithms to predict user-level program/channel viewership probabilities, achieving 30% accuracy improvement over previous models 2. Designed predictive analytics solutions using Linear Regression to forecast aggregated TV ratings, informing programming and advertising strategies 3. Conducted statistical correlation analysis on 10M+ viewer records to identify 15+ key behavioral patterns and demographic drivers of TV engagement 4. Established data-driven insights framework guiding $2M+ annual content acquisition decisions through feature importance analysis 5. Built hybrid TV advertising data models across SQL (MySQL), NoSQL (Presto), and cloud storage (S3), reducing query latency by 40% through indexing optimization 6. Architected star schema for ad performance tracking, enabling cross-platform campaign analysis across 50+ metrics 7. Engineered Scala Spark ETL pipelines processing 1.2TB daily of Nielsen ratings data, achieving 99.98% uptime SLA 8. Implemented S3-based data lake with Athena SQL interface supporting 150+ concurrent analysts through partitioned Parquet storage Data Modelling: Created TV Ad data model in NOSQL and SQL databases in Presto, MYSQL and S3. Created TV ratings data model in Scala Spark and storing it into Amazon S3 and accessing it via Amazon Athena. Data Analysis: Extracted actionable insights on TV viewership data (Big data) using zeppelin and matplotlib
Masters Assistant
- Candidate for Masters of Science in Computer Science May 2016 Related Courses: Parallel Data Processing in Map-Reduce, Machine Learning, Algorithms Projects: • Fighting Crime with Big Data – Java, Amazon Web Services – Elastic Map Reduce, S3, Weka: Developed a tool that predicts the crime location in Boston and Chicago streets by running multiple Map-Reduce programs on large dataset of crime using K-D tree Algorithm.
Data Science Intern
Senior Software Engineer
- Worked for a Financial Corporation – Auto Loans for a client in US 1. Designed and optimized PL/SQL packages, procedures, and functions to streamline data retrieval processes, improving query performance by 35%. 2. Developed ETL workflows using PL/SQL for data transformation and migration between Oracle databases, ensuring compliance with business logic for report generation. 3. Resolved critical defects in an Auto-Loan System by debugging PL/SQL scripts and Java code, reducing production incidents by 50%. 4. Automated data movement tasks using Python scripts, integrating with Oracle databases to reduce manual processing time by 25%. 5. Collaborated with cross-functional Agile teams to deliver new system features, including Unix shell scripts for job scheduling and batch processing. 6. Conducted database performance tuning by analyzing execution plans, indexing strategies, and materialized views, cutting latency by 20%. Worked for a Financial Corporation – Auto Loans for a client in US 1. Designed and optimized PL/SQL packages, procedures, and functions to streamline data retrieval processes, improving query performance by 35%. 2. Developed ETL workflows using PL/SQL for data transformation and migration between Oracle databases, ensuring compliance with business logic for report generation. 3. Resolved critical defects in an Auto-Loan System by debugging PL/SQL scripts and Java code, reducing production incidents by 50%. 4. Automated data movement tasks using Python scripts, integrating with Oracle databases to reduce manual processing time by 25%. 5. Collaborated with cross-functional Agile teams to deliver new system features, including Unix shell scripts for job scheduling and batch processing. 6. Conducted database performance tuning by analyzing execution plans, indexing strategies, and materialized views, cutting latency by 20%. Skills: PL/SQL · oracl · Java
Education
Northeastern University (NEU/NU), Boston
Watumull Institute of Engineering And Technology (WIET)
Projects
How Mobility Informs Epidemic Dynamics Districts Sierra Leone
Question: How does mobility inform epidemic dynamics on the district level in Sierra Leone
Data Used/Wrangled/Processed: Sierra Leone Roads; Sub-national infection dataset, Sierra Leone; general demography Sierra Leone (CIA World Factbook); NEJM article for certain parameters
Analysis Approach: SEIR Model with Metapopulation Model Approach (Multi-compartments / SEIR for each district and flows between); Preliminary Training & Testing; Graphs/Visuals; GIS;
Findings/Limitations: Estimated ~Mobility Weights between Districts; Tested November Predictions Compared to November Observations Cumulative Case Numbers with preliminary measurement of error (need to extend to p-values? need to measure cross-validation by district &/or time); Predicted December
Possible Applications: 1) Predict Epidemic Dynamics In Future Over Time & Space (By District); 2) Use Trained, Validated, Tested Model + all current data mobility + case data to predict optimal locations/expansions of mobility checkpoints/restrictions (locations to test/treat/isolate/restrict-movement/etc.) in order to stem epidemic
Certificates & Licenses (1)
Improving Deep Neural Networks: Hyperparameter tuning, Regularization and Optimization
Improving Deep Neural Networks: Hyperparameter Tuning, Regularization and Optimization
Awards & Achievements (1)
🏆 ROBO SOCCER EVENT in GENESIS 08
Description
Peer-Reviewed Articles (31)
The rise of AI-powered automation is reshaping job markets worldwide, presenting both opportunities and challenges. AI has the capability to streamline repetitive tasks, improving operational efficiency and reducing the necessity...
Artificial Intelligence (AI) is increasingly becoming a transformative force in the healthcare sector, driving innovations in diagnostics, treatment personalization, and operational efficiencies. This paper examines the significant impact of AI...
The goal of image classification, a critical task in computer vision, is to group images into specified classes according to their visual attributes. To tackle this problem, a variety of...
To effectively manage tomato crops, illnesses must be detected early and accurately, as they can have a substantial impact on output and quality. This study investigates the use of deep...
Facial recognition using AI is a rapidly evolving technology that uses machine learning algorithms and neural networks to identify and authenticate people based on their facial features. This technology has...
Role in Research Journals (9)
Editorial Board Member
IJNRD
Assistant Editor
IJNRD
Advisory Member
IJNRD
Editorial Board Member
IJNRD
Guest Editor
IJNRD
Publications (2)
When it comes to financial markets, information asymmetry, which occurs when different players have access to
differing degrees of knowledge, may result in inefficient markets, skewed pricing, and un...
In the competitive landscape of mobile applications, user engagement is a critical factor for success. Deep linking emerges as a powerful technique to enhance user experience by facilitating seamless...
dd