Abhishek Das

Software Engineering Lead at Salesforce, Inc.
📚 Software Engineering Lead | Bellevue, Washington, United States
Mutual Connections
Loading...
S9-102024-0406205
0 Publications
0 Followers
0 Following
2 Questions

👤 About

Skills & Expertise

cloud computing SPARK SQL HIVE Postgres SQL Teradata SQL JavaScript node.js Artificial Intelligence Distributed Systems Java Hadoop Load Balancing Auto Scaling Emerging technologies Redis Edge Computing Junit Data Protection Data Migration cloud architecture Real-Time Inference Network Control Baremetal Services Service Fabric Control Plane Computational Marketing Display Ads Fault Tolerance Neo4j Database Innovation & Mentorship test NG

Research Interests

cloud computing Machine Learning Software Engineering Team Building Computing Mentorship Skills Network Control Plane Data Migration Auto-Scaling Solutions Service Fabric Orchestration Display Ads Engineering Fault Tolerance Graph Databases (Neo4j) Intelligent Systems Hadoop Distributed Systems

Connect With Me

💼 Experience

Software Engineering Lead

Salesforce, Inc. · September 2021 - Present
  • • Led design and development of a sophisticated multi-step multi-tenant Directed Acyclic Graph (DAG) execution service which enabled stitching of multiple steps required for successful inferences at extremely high scale with millisecond latencies. • Led Successful onboarding of several predictive and generative use cases with 100+ custom plugins written by app developers across multiple organizations. • Engineered on-demand scoring ML platform with a high-scale, low-latency multi-step DAG execution system for B2C applications. • Led the cross organizational effort of design and development of an innovative platform tailored for hosting Large Language Models (LLMs), specifically addressing CodeGen and Flow GPTs (Patent Pending). Ensured seamless handling of inference requests with an emphasis on achieving low latency and high scalability while leveraging some of current platform services. • Designed an innovative GDPR compliance violation resolution service capable of scanning petabytes of data. (Java, AWS EMR, Kubernetes, GRPC, REST, PyTorch, Python, Sagemaker, Inference Frameworks)

Senior Software Engineer Tech Lead

Microsoft · January 2017 - September 2021
  • 1. Azure Dedicated (10/20 - 9/21) - Tech lead (Network control plane) for building Baremetal as a service for dedicated workloads onboarding Nutanix as the 1st workload. - Made several improvements in the stack and processes around the team. - Mentored new engineers in the team. 2. Azure Stream Analytics (1/17 - 9/20) - Part of the core leads team to deliver Azure Stream Analytics dedicated Clusters (https://docs.microsoft.com/en-us/azure/stream-analytics/cluster-overview) - Lead control plane effort for Stream Analytics on edge devices (https://docs.microsoft.com/en-us/azure/stream-analytics/stream-analytics-edge) - Billing lead for Stream Analytics offerings. - Owned and designed the orchestrator on control plane responsible for managing the Stream Analytics jobs. - Delivered several new features like parquet input, auto scale ( https://docs.microsoft.com/en-us/azure/stream-analytics/stream-analytics-autoscale) customer provided storage ( https://docs.microsoft.com/en-us/azure/stream-analytics/data-protection) etc. - Enabled data plane migration from HDI to service fabric by moving some of migration resiliency and orchestration to control plane. - Mentored Junior engineers

Software Development Engineer

Groupon · April 2016 - January 2017
  • • Developed tools and features with REST APIs for internal teams, utilizing machine learning algorithms for display ads bidding prices. • Led migration of the core bidding engine from Teradata to HDFS, optimizing performance. • Implemented a Real-Time Attribution data pipeline with a supporting dashboard for pricing updates. • Collaborated with marketing and product teams for new features and requirements. (Java, JavaScript, node, SQL, Postgres, Redis, Teradata, test NG, JUnit, Hadoop, Hive, Spark)

Software Engineer

Dell EMC · July 2012 - March 2016
  • • Designed and implemented a high-performance Hyper-V Multi-Proxy backup architecture for Avamar and Networker's next release. • Enhanced backup efficiency over CSV and SMB storage, achieving significant performance improvements. • Secured a patent (9639428) for optimizing backup processes in clusters with multiple proxy servers. • Balanced backup loads efficiently among proxies, enabling parallel backups and boosting overall performance. • Provided engineering support and hotfixes for high-priority customer escalations, ensuring rapid issue resolution. (C++, C#, Win APIs, PowerShell, Socket Programming, IPC, XML)

Co-op, R&D IT

GSK · June 2011 - December 2011

Web Developer(Student)

Texas A&M University · August 2010 - May 2011

🎓 Education

Texas A&M University

Master of Science (MS) in Computer Science · 2012

Odisha University of Technology and Research (OUTR)

B.Tech in Computer Science & Engineering · 2009

🚀 Projects

Improved HDFS Block Allocation Scheme
Agency Name: || Jan 2012 - Jan 2012
Modified the Block Allocation Scheme to keep related files together based on a tag associated with the file.
Dynamic Replication of Blocks in Hadoop
Agency Name: || Jan 2012 - Jan 2012
Hadoop HDFS makes 3 copies of each chunk by default. The project was to modify existing hadoop codebase to introduce dynamic replication for frequently requested blocks to improve response time.
Hadoop Block Placement
Agency Name: Texas A&M University || Jan 2012 - Apr 2012
Designed an improved placement algorithm for HDFS by placing related files in the same subset of nodes.
Trivia Pitcher
Agency Name: Texas A&M University || Mar 2011 - Present
An online multi-user trivia system which traverses a neo4j graph database created from Wikipedia xml dump to create trivia questions and provides questions based on the user’s difficulty level which is determined by several users and page attributes. (PHP, neo4j API)
Distributed Password Cracker
Agency Name: Texas A&M University || Mar 2012 - Present
It creates a distributed environment with a single meta data server and several workers. We tried to implement few aspects of a distributed system such as load balancing, worker failure scenarios etc.

🏅 Certificates & Licenses (3)

Blockchain Basics
Event: Blockchain Basics · Lynda.com · Issued on September 2017
Blockchain technology demands attention. And it's not just due to the way it revolutionized monetary transactions. The concept of transactional trust through code has disrupted a variety of industries, offering ways to streamline everything from financial services to identity management. In this introductory-level course, learn what blockchain is and what it might mean to you. Instructor Jonathan Reichental dives into the topic from a conceptual perspective, ensuring that a nontechnical audience can glean value from each lesson. Jonathan describes some of the current challenges with the internet, including existing risks and security problems. He goes over how traditional online databases function, so that you have a basis for how the blockchain redesigns this function. Plus, learn how the blockchain has become a potential solution for many of the existing limitations of online databases, which challenges must be solved in order to achieve broader adoption, and more. https://www.linkedin.com/learning/blockchain-basics-14414119?trk=lynda_redirect_learning
Cloud Architecture: Advanced Concepts
Event: Cloud Architecture: Advanced Concepts · Lynda.com · Issued on October 2017
If you’re an IT professional, you already know that new architectural patterns and concepts are changing the way we design for the cloud. This is especially true in today’s remote and hybrid work environment, where choosing the right type of cloud platform can determine the success and functionality of your entire IT enterprise. Join instructor Lee Atchison as he shows you the business and tech-related basics of orchestrating and managing an effective cloud architecture. Learn about the various types of cloud platforms and architecture types, as well as how to determine which option will work best for you. Explore service and microservice architectures, data management tools, serverless computing, and cloud security as you build out your infrastructure. Lee shows you how to successfully incorporate automation and management practices into your architecture strategy so you can create a more efficient environment for software development. https://www.linkedin.com/learning/cloud-architecture-advanced-concepts-14595141
Generative AI with Large Language Models
Event: Generative AI with Large Language Models · Amazon Web Services · Issued on January 2006
https://www.coursera.org/account/accomplishments/verify/D2YYYONEF8PU

🏆 Awards & Achievements (1)

🏆 EMC Excellence Award
Awarded by: EMC corporation || Year: 2013
Description

💡 Patents (1)

Optimized backup of clusters with multiple proxy servers
Patent No. 9639428 · Filed March 2014 · Issued on May 2017
Inventors: Koteswara R. Boda (Redmond, WA), Abhishek Das (Seattle, WA), Matthew D. Buchman (Seattle, WA) || Category: Computer Sciences
description
dd