👋 Hello, I'm

Girish E

I |

Transforming raw data into strategic decisions — one pipeline at a time.

0 +

Years Experience

0 +

Projects Shipped

0 +

Tools Mastered

>> About_Me

I architect highly-scalable data ecosystems and digital products. As an Associate Data Engineer, I build robust Modern Data Stack solutions utilizing Python, Databricks, and Apache Spark. My high-throughput ETL pipelines process 10M+ daily records and have slashed latency by up to 40% via deep Snowflake optimization.

Beyond traditional Big Data warehousing with Airflow and dbt, I am a passionate builder of Agentic AI workflows. I design intelligent architectures—from an Edge Store extension with 20+ features, to an AI finance PWA with active daily users. By fusing enterprise data engineering with local LLM APIs and JavaScript, I turn raw data into deeply engaging software.

>> What_I_Do

⚙️

Data Pipeline Engineering

Design and maintain scalable ETL/ELT pipelines using Apache Airflow and Talend that handle millions of records daily.

❄️

Data Warehousing

Architect and optimize data warehouses in Snowflake and other cloud platforms with star/snowflake schema designs.

📊

Analytics & Visualization

Build interactive executive dashboards in Tableau and Power BI that turn complex datasets into clear business narratives.

🤖

ML & Data Science

Apply supervised and unsupervised learning models, NLP techniques, and statistical analysis to drive predictions.

>> Tech_Stack

Modern Data Stack & Warehousing

85%
SnowflakedbtPostgreSQLBigQuery

Programming Languages

80%
PythonSQLScalaJavaScript

ML & Agentic AI Workflows

75%
LLM APIsLangChainAgentic PatternsScikit-Learn

Data Orchestration

70%
Apache AirflowAirbyteFivetran

AI Analytics & BI

75%
Power BITableauGenerative BI

Cloud & MLOps

65%
AWS (S3/EC2)GCPDocker

>> Experience_Log

Download PDF
Full-Time

Associate Data Engineer

Anvizent Analytics  |  Nov 2023 – Present
  • Partnered directly with enterprise clients to map workflows and engineered custom REST API and webhook connectors for Oracle, NetSuite, and Epicor, reducing integration time by 50%.
  • Owned end-to-end Airflow DAGs on Kubernetes with Docker containers, optimizing resources and increasing pipeline reliability by 40%.
  • Led Databricks Spark processing for 10 million+ daily records, resolving metadata issues and delivering 30% efficiency gains.
  • Built monitoring, alerting, and SLA systems that reduced downtime by 25%, giving clients full visibility into data health.
Internship

Data Engineer Intern

Anvizent Analytics  |  May 2023 – Oct 2023
  • Optimized 500+ Talend, Python, and SQL pipelines for real-time ERP flows, improving throughput by 25% and cutting costs by 15%.
  • Developed Python scripts with retry logic and async processing for ERP API extraction, enabling seamless batch and real-time refreshes.
  • Containerized workflows with Docker and tested Kubernetes deployments for scalable, fault-tolerant operations.

>> Education_Log

Bachelors in Data Science

National College Jayanagar  |  2020 – 2023
  • Studies included Statistics, SQL, Machine Learning, Data Analytics, NLP, and Cloud Computing.
  • Proficient with AWS, Tableau, and Power BI. Final year project on customer churn prediction.

Science Stream (PCMC)

National College Jayanagar  |  2018 – 2020
  • Physics, Chemistry, Mathematics, and Computer Science with distinction.

>> Featured_Projects

>> Git_Repositories

$ git fetch api.github.com/users/girishdataprofessional/repos

Connecting to GitHub API...

>> Key_Achievements

🧹

CleanTube — Live on Microsoft Edge Store

Solo-built browser extension with 20+ features: full ad blocker, Zen Mode, Gemini AI video summaries, Deep Work Timer, analytics dashboard, SponsorBlock, smart bookmarks, voice notes dictation, and NSFW friction guard — zero data collection, 100% local.

💰

Smart Finance Manager — 25+ Daily Active Users

AI-powered finance PWA with voice expense entry, intelligent budget alerts, real-time charts, and multi-currency support. Serving a growing base of 25+ active daily users — installable as a native app on any device.

🚀

40% Pipeline Latency Reduction

Optimized ETL workloads via query pruning and Snowflake clustering at Anvizent Analytics.

Airflow Run-Now Feature

Designed and shipped a bespoke Airflow plugin that eliminated manual pipeline trigger delays across 50+ production DAGs.

📊

C-Suite Dashboard Delivery

Built executive-facing Tableau dashboards consumed by leadership to drive strategic decisions.

🎓

BSc Data Science — Distinction

Graduated with distinction from National College Jayanagar with a final project on churn prediction.

>> Certifications