Lokeswar Kudumula

Lokeswar Kudumula

Data Engineer & Platform Architect

Building enterprise-grade blockchain data pipelines. Specializing in ETL systems, Snowflake data warehousing, and production Kubernetes deployments.

RemoteJoined Aug 2025

Experience

Data Engineer & Platform Architect

MugenLink Network

Aug 2025 - PresentRemote

Enterprise-grade blockchain data platform processing multi-chain cryptocurrency analytics.

  • Architected production ETL pipeline processing 6 cryptocurrencies (BTC, BCH, DOGE, LTC, DASH, ZEC) with 17GB+ data using custom ScriptRunner CLI framework.
  • Built 3-tier Snowflake Data Warehouse (RAW → STAGING → ANALYTICS) with 13 RBAC roles, RSA key-pair authentication, and 8 specialized service accounts.
  • Developed 30 dbt models with incremental materialization achieving 20-40% faster builds using merge strategy and Polars-based schema inference (3-5x faster than pandas).
  • Implemented 8-stage CI/CD pipeline with GitHub Actions: lint, security scan (Trivy/Bandit), 70%+ test coverage, container scanning, and SonarQube integration.
  • Deployed on Kubernetes (k3s) with Helm charts, Linkerd service mesh (zero-trust), HPA auto-scaling, and Doppler secrets management.
  • Built async FastAPI backend with Redis caching, OpenTelemetry distributed tracing, circuit breaker pattern, and rate limiting serving real-time analytics.
Python
Snowflake
dbt
Airflow
FastAPI
Kubernetes
Kafka
Redis
AWS S3
GitHub Actions

System Engineer

Infosys Ltd. (Proximus)

Aug 2021 - Mar 2023Trivandrum, India

Telecom infrastructure management for 4M+ subscribers.

  • Automated server health monitoring reducing manual checks by 87% using Ansible playbooks and Bash scripts.
  • Reduced incident resolution time by 40% implementing ELK Stack for centralized log analysis and anomaly detection.
  • Optimized SQL queries improving identity data validation speed by 60% through query tuning and indexing strategies.
  • Managed CI/CD pipelines for 10+ microservices on OpenShift with Jenkins, achieving 99.5% deployment success rate.
Ansible
ELK Stack
SQL
OpenShift
Jenkins
Dynatrace
Bash
Linux

Projects

MugenLink Network

Featured

Enterprise-grade blockchain data platform with full ETL pipeline, real-time analytics, and production Kubernetes deployment.

  • Complete data pipeline: Blockchair API → S3 (AES-256 encrypted) → Snowflake with 1MB chunked downloads, 4-worker concurrency, and checkpoint/resume support.
  • 30 dbt analytics models across 5 layers (staged, core, intermediate, marts, analysis) enabling wallet transaction tracing with running balance calculations.
  • Production Kubernetes deployment with Helm charts, Linkerd zero-trust networking, Prometheus/Grafana monitoring, and automated HPA scaling.
  • 8-stage CI/CD quality gates: formatting, security scanning, 70%+ test coverage, container vulnerability scanning, and SonarQube analysis.
Snowflake
dbt
Airflow
FastAPI
Kubernetes
Kafka
Redis
OpenTelemetry

Blockchain Analytics Dashboard

Real-time cryptocurrency analytics dashboard with wallet tracing, portfolio tracking, and network exploration.

  • Built with Nuxt 3 and Vue 3 featuring Bento Grid layout, real-time WebSocket updates, and mobile-responsive design.
  • Portfolio tracker with wallet-centric transaction tracing and fund flow visualization across 6 blockchain networks.
Nuxt.js
Vue 3
Tailwind CSS
FastAPI
WebSocket

Distributed ETL Framework

Reusable Python library for building production ETL pipelines with built-in observability and resilience patterns.

  • 5,607 lines of production CLI code with ScriptRunner framework, standardized error handling, and comprehensive logging.
  • Features include circuit breaker pattern, thread-safe rate limiting, checkpoint/resume support, and W3C Trace Context propagation.
Python
OpenTelemetry
Circuit Breaker
Async/Await

Data Quality Framework

Automated data validation integrated with Airflow DAGs and dbt tests for ensuring pipeline reliability.

  • Custom validators for schema enforcement, data completeness, and referential integrity across 6 cryptocurrency datasets.
  • SLA monitoring with 4-10h phase thresholds, automated alerting, and recovery handlers for failed tasks.
Python
dbt
Great Expectations
Airflow

Skills

Data Engineering

ETL/ELT Pipelines
Apache Airflow
dbt
Snowflake
Data Modeling
Apache Kafka
Data Warehousing
Schema Design

Programming

Python
SQL
Bash
FastAPI
Async/Await
Pydantic
Polars
pytest

Cloud & DevOps

AWS S3
Kubernetes
Docker
Helm
GitHub Actions
Terraform
CI/CD
Linux

Observability

OpenTelemetry
Prometheus
Grafana
ELK Stack
Distributed Tracing
Structured Logging

Security & Quality

RBAC
RSA Auth
Trivy
Bandit
SonarQube
Zero-Trust

Tools

Git
Redis
PostgreSQL
Doppler
Linkerd
n8n

Education

Master of Professional Studies

Data Science

University of Maryland, Baltimore County

Baltimore, MDMay 2025

Bachelor of Technology

Mechanical Engineering

Sree Vidyanikethan Engineering College

Tirupati, IndiaJune 2021

About Me

Passionate data engineer with expertise in building scalable, production-grade data platforms. I specialize in designing end-to-end ETL pipelines, implementing data warehouse architectures, and deploying resilient distributed systems on Kubernetes.

My work on MugenLink Network involves processing multi-chain blockchain data across 6 cryptocurrencies, leveraging Snowflake for analytics, dbt for transformation, and Airflow for orchestration — all running on a zero-trust Kubernetes cluster with comprehensive observability.

Get in Touch

Interested in collaborating or have questions about the platform? Feel free to reach out.