• Keen Learner • Software Engineer
3 Years of Experience
Pune, Maharashtra, India
-
-
Not Available
Experienced Big Data Engineer proficient in Apache Spark, Hadoop, Scala, Java, Python, C, and C++, with a keen interest in Machine Learning. Expertise in designing and optimizing large-scale data processing systems, coupled with a passion for harnessing data-driven insights. Committed to pushing the boundaries of technology to drive innovation and enhance business outcomes.
PubMatic, SaaS/Cloud Product, Computer Software
HighRadius
Omdena
PubMatic, HighRadius, Omdena, TheSmartBridge, HighRadius
Job Title : Software Engineer
Company name : PubMatic
Period : August 2021 - Present
Summary : PubMatic is a digital advertising technology company who empowers premium app developers and publishers to maximize their programmatic advertising business.
• Successfully led the migration of a Kinetica-backed Real Time pipeline to an In-house Spark-Structured Streaming solution, ensuring seamless data flow and reducing dependency on external platforms.
• Optimized the performance of Spark Structured Streaming Consumers, resulting in a remarkable 50% reduction in batch processing time
• Spearheaded initiatives to enhance query processing efficiency for Metabase dashboards, achieving an impressive 60% reduction in query processing time through the implementation of materialized views
Technologies: Apache Spark, Hadoop, Snowflake, Mysql, Scala, Java8, SpringBoot, Python, Apache Nifi,
Snowflake, Docker, Shell scripting, Looker, Airflow, Oozie.
Location : Pune, Maharashtra, India
Job Title : Data Science Intern
Company name : HighRadius
Period : January 2021 - June 2021
Summary : HighRadius is a Fintech enterprise Software-as-a-Service (SaaS) company which leverages Artificial Intelligence-based Autonomous Systems to help companies automate Accounts Receivable and Treasury processes
About Internship:
• Worked on Machine Learning Pipelines to predict remittances, including monitoring and analysis, testing and deploying them in production for clients.
• Analyzed and reported the cases where model failed to perform.
Job Title : ML Engineer
Company name : Omdena
Period : November 2020 - January 2021
Summary : Omdena is the collaborative platform to build innovative, ethical, and efficient AI and Data Science solutions to real-world problems
About Project:
Collaborating with 30+ intellectuals around the globe on Analyzing the Role of Connectivity on Economic & Human Development hosted by UNDP.
• Find available data online for training purposes.
• Extract, transform and load data
• Research, experiment with, and implement suitable ML algorithms and tools
• Use data modeling and evaluation strategy to find patterns and predict unseen instances
• Apply machine learning algorithms and libraries
• Analyze large, complex datasets to extract insights and decide on the appropriate technique
Location : Remote
Job Title : Call For Code 2020
Company name : TheSmartBridge
Period : September 2020 - October 2020
Summary : • Participated in Call For Code 2020 to collaborate on a project under the mentorship of
Hemant Kumar Gahlot and colleges: Snigdha, Sourick, and Gaurav.
• Built and deployed a wind energy prediction application with diminishing MSE score to 0.112
Job Title : Summer Intern
Company name : HighRadius
Period : April 2020 - June 2020
Summary : HighRadius is a Fintech enterprise Software-as-a-Service (SaaS) company which leverages Artificial Intelligence-based Autonomous Systems to help companies automate Accounts Receivable and Treasury processes
About Internship:
• Managed complex projects from start to finish
• Built a full-stack AI-enabled FinTech B2B Invoice Management Application
Title : The Rust Programming Language
Period : September 2023 - Present
Issuing Authority : Udemy
Title : Google Looker Masterclass: Looker & LookML A-Z 2023
Period : April 2023 - Present
Summary : google.com, https://drive.google.com/file/d/1GB-zZ1dYsyZylDWQn5967r1woOyWfFl6/view?usp=sharing
Issuing Authority : Udemy
Title : Apache Spark with Scala - Hands On with Big Data!
Period : September 2021 - Present
Summary : google.com, https://drive.google.com/file/d/103QBBqOV7Kfh3cTNGmwUgmvscuXQiyRK/view?usp=sharing
Issuing Authority : Udemy
Title : From 0 to 1: The Oozie Orchestration Framework
Period : September 2021 - Present
Summary : google.com, https://drive.google.com/file/d/1A9XbaDfPRgXv3mPnAavM597vhumNn7pS/view?usp=sharing
Issuing Authority : Udemy
Title : Maven Quick Start: A Fast Introduction to Maven by Example
Period : September 2021 - Present
Summary : google.com, https://drive.google.com/file/d/1pUriwYimlfdvOz5PltHKkYFie_XQaZF_/view?usp=sharing
Issuing Authority : Udemy
Title : Design Thinking and Predictive Analytics for Data Products
Period : February 2021 - Present
Summary : NUSMKRKATWLG, coursera.org, https://www.coursera.org/account/accomplishments/certificate/NUSMKRKATWLG
Issuing Authority : Coursera Course Certificates
Title : Meaningful Predictive Modeling
Period : February 2021 - Present
Summary : Z3YTX8YNE4G8, coursera.org, https://www.coursera.org/account/accomplishments/certificate/Z3YTX8YNE4G8
Issuing Authority : Coursera Course Certificates
Title : Basic Data Processing and Visualization
Period : January 2021 - Present
Summary : 2P7MLF9DGXQX, google.com, https://drive.google.com/file/d/1ILWZKoucbILv_crSqUx9oi9Ub9WWyB0u/view?usp=sharing
Issuing Authority : Coursera Course Certificates
Title : Image Processing Using Keras With Python
Period : January 2021 - Present
Summary : google.com, https://drive.google.com/file/d/1k5o98BCpT2JTN2s6usdpfEZROVrOWinN/view?usp=sharing
Issuing Authority : DataCamp
Title : Basics of Geocomputation and Geoweb Services
Period : December 2020 - Present
Summary : 3d788949ce90d3ea14a5bcdf0efffa84, google.com, https://drive.google.com/file/d/1mL0-JesJYNaLHcnyTXf_DHw5b4lu91js/view?usp=sharing
Issuing Authority : Indian Institute of Remote Sensing (IIRS), Indian Space Research Organization (ISRO)
Title : Natural Language Processing Specialization
Period : October 2020 - Present
Summary : KG7RLTECXFF7, coursera.org, https://www.coursera.org/account/accomplishments/specialization/certificate/KG7RLTECXFF7
Issuing Authority : deeplearning.ai
Title : Deep Learning Specialization
Period : September 2020 - Present
Summary : 4Z2BYBG36BKX, coursera.org, https://www.coursera.org/account/accomplishments/specialization/certificate/4Z2BYBG36BKX
Issuing Authority : deeplearning.ai
Title : Git + GitHub for Open Source Collaboration
Period : September 2020 - Present
Summary : G4CDP2C55F6V, coursera.org, https://www.coursera.org/account/accomplishments/certificate/G4CDP2C55F6V
Issuing Authority : Coursera
Title : Linux for Developers
Period : September 2020 - Present
Summary : GM2PQHGRA7DA, coursera.org, https://www.coursera.org/account/accomplishments/certificate/GM2PQHGRA7DA
Issuing Authority : The Linux Foundation
Title : ETL and Data Pipelines with Shell, Airflow and Kafka
Period : August 2022 - November 2023
Summary : DTZWTTM5G25M, credly.com, https://www.credly.com/badges/d4b13a2b-2c00-4fc2-be1c-2fe6692feb54/public_url
Issuing Authority : IBM
Show More