Skip to content
View charan3129's full-sized avatar
  • 20:42 (UTC -04:00)

Block or report charan3129

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
charan3129/README.md

Profile Views


Hi, I'm Sai Charan!

Typing SVG


👨‍💻 About Me

  • 🏢 Currently building cloud-native data pipelines at Walmart, processing 2TB+ daily
  • 🏗️ Designing enterprise data lakes on ADLS Gen2 integrating 10+ retail systems
  • ⚡ Migrated legacy ETL to event-driven pipelines using Kafka, increasing throughput by 40%
  • 🎓 M.S. Data Science from UMass Dartmouth
  • 🔍 Passionate about data quality, automation, and clean architecture
  • 📚 Always learning — currently exploring advanced dbt patterns and Terraform

🛠️ Tech Stack

☁️ Cloud & Warehousing

Azure Azure Databricks Azure Synapse ADLS Gen2 Snowflake AWS S3 AWS Redshift Google BigQuery

🔥 Big Data & Streaming

Apache Spark PySpark Apache Kafka Event Hubs Hive HDFS

💻 Languages & Scripting

Python SQL Spark SQL Bash

⚙️ Orchestration & DevOps

Apache Airflow dbt Docker Terraform Git GitHub Actions Jenkins

📊 Visualization

Power BI Tableau Looker


🚀 Featured Projects

End-to-end real-time data pipeline for retail analytics with streaming ingestion, quality checks, and interactive dashboards.

Kafka PySpark Delta Lake Snowflake dbt Airflow Great Expectations Streamlit


Scalable healthcare data pipeline with automated quality validation, cloud storage integration, and real-time monitoring dashboards.

Python Snowflake dbt Airflow Great Expectations AWS S3 Streamlit


Production-grade dbt project implementing Kimball star schema with 35+ data quality tests and full CI/CD automation on Snowflake.

dbt Snowflake Kimball Tests CI/CD


📈 GitHub Stats

GitHub Streak


🏆 GitHub Trophies

trophy


🤝 Connect With Me

LinkedIn Gmail GitHub

💡 Open to collaborating on data engineering projects and cloud data platform initiatives

Popular repositories Loading

  1. charan3129 charan3129 Public

    Config files for my GitHub profile.

    1

  2. dbt-Analytics-Engineering-Project dbt-Analytics-Engineering-Project Public

    End-to-end analytics engineering project using dbt and Snowflake. Transforms raw e-commerce data through staging, intermediate, and marts layers into a Kimball star schema with fact and dimension t…

    1

  3. Real_Time_Retail_Pipeline Real_Time_Retail_Pipeline Public

    End-to-end real-time retail data pipeline using Kafka, PySpark, Delta Lake, Snowflake, dbt, Airflow, Great Expectations, and Streamlit — featuring medallion architecture (Bronze/Silver/Gold), Kimba…

    Python 1

  4. charan3129.github.io charan3129.github.io Public

    Github.io

    CSS 1

  5. Healthcare-Data-Pipeline Healthcare-Data-Pipeline Public

    End-to-end healthcare analytics pipeline ingesting CMS Medicare and openFDA data via Python APIs, staging through AWS S3, loading into Snowflake, transforming with dbt (Kimball star schema), valida…

    Python 1

  6. Healthcare_Analytics_Drug_Safety Healthcare_Analytics_Drug_Safety Public

    Azure-native healthcare analytics platform ingesting CMS Medicare and openFDA drug safety data via ADF into ADLS Gen2, loading into Snowflake, transforming with dbt into star schema, validating wit…

    Python 1