I am a Certified Data Engineer with over 2 years of experience in crafting and optimizing scalable ETL processes and data pipelines. My proficiency in Python, SQL, Spark, and cloud platforms like Azure and AWS allows me to develop robust data solutions that drive efficiency and operational excellence. I excel in designing and deploying ETL pipelines, utilizing big data technologies such as PySpark and Hadoop, and automating workflows to streamline operations and cut costs.
Beyond engineering, I possess a strong background in data analytics. I leverage statistical analysis and machine learning to extract actionable insights and support strategic decision-making. Whether it's performing regression analysis or building predictive models, I transform complex data into valuable business insights that propel growth and enhance performance.
✨ Always ready for a data-driven adventure and excited to tackle the next big challenge!
0 + Projects completed
Developed a real-time social media listening system with Kafka, analyzing Reddit data using NLP for sentiment, entities, and topics.
Built an AWS Glue ETL pipeline with S3 storage, applying regression, random forest, and XGBoost for price prediction. Developed 5+ QuickSight dashboards for data-driven insights.
Developed a Python-based ML model with Scikit-Learn to detect and predict IoT network anomalies, achieving 95% accuracy.
Developed a computer vision surveillance system using YOLOv5 to identify riders and license plates, targeting helmet violations and triple riding, with 95% accuracy on 20GB of video data
Designed and developed OLTP/OLAP systems, optimizing SQL queries and reducing execution time by 40%. Structured databases using EER and UML. Conducted EDA and created Tableau visualizations for key metrics.
Analyzed developer perspectives on AI using Stack Overflow's 2023 Developer Survey, visualizing insights with Tableau and D3.js.
Data Analytics Professional with over 2 years of experience in developing data-driven insights, optimizing data processes, and building predictive models to support strategic decision-making. Proficient in Python, SQL, Excel, Power BI, and cloud platforms (Azure, AWS), with a strong background in data visualization, statistical analysis, and automating data workflows to enhance business performance.
Python, SQL, JavaScript, C.
Data EngineeringDatabricks, Azure Data Factory, Azure Synapse, Hadoop, Airflow, Kafka, AWS Glue, AWS S3, Dbt.
Data Analytics ToolsTableau, Power BI, Looker, MS Excel, Streamlit, Jupyter Notebooks.
MySQL, PostgreSQL, MongoDB (NoSQL), Neo4j (Graph DB).
FrameworksFlask, PySpark, Pytorch, TensorFlow, Keras, Sci-kit Learn, Numpy, Pandas, Matplotlib.
Machine LearningRegression, Classification, Clustering, Time Series Analysis, Hypothesis Testing, A/B Testing, NLP.
San Jose, CA 95113
shrinivas.bhusannavar@sjsu.edu
shrinivasab97@gmail.com