Experience
Data Engineer & Platform Architect
MugenLink Network
Enterprise-grade blockchain data platform processing multi-chain cryptocurrency analytics.
- Architected production ETL pipeline processing 6 cryptocurrencies (BTC, BCH, DOGE, LTC, DASH, ZEC) with 17GB+ data using custom ScriptRunner CLI framework.
- Built 3-tier Snowflake Data Warehouse (RAW → STAGING → ANALYTICS) with 13 RBAC roles, RSA key-pair authentication, and 8 specialized service accounts.
- Developed 30 dbt models with incremental materialization achieving 20-40% faster builds using merge strategy and Polars-based schema inference (3-5x faster than pandas).
- Implemented 8-stage CI/CD pipeline with GitHub Actions: lint, security scan (Trivy/Bandit), 70%+ test coverage, container scanning, and SonarQube integration.
- Deployed on Kubernetes (k3s) with Helm charts, Linkerd service mesh (zero-trust), HPA auto-scaling, and Doppler secrets management.
- Built async FastAPI backend with Redis caching, OpenTelemetry distributed tracing, circuit breaker pattern, and rate limiting serving real-time analytics.
System Engineer
Infosys Ltd. (Proximus)
Telecom infrastructure management for 4M+ subscribers.
- Automated server health monitoring reducing manual checks by 87% using Ansible playbooks and Bash scripts.
- Reduced incident resolution time by 40% implementing ELK Stack for centralized log analysis and anomaly detection.
- Optimized SQL queries improving identity data validation speed by 60% through query tuning and indexing strategies.
- Managed CI/CD pipelines for 10+ microservices on OpenShift with Jenkins, achieving 99.5% deployment success rate.
Projects
MugenLink Network
Enterprise-grade blockchain data platform with full ETL pipeline, real-time analytics, and production Kubernetes deployment.
- Complete data pipeline: Blockchair API → S3 (AES-256 encrypted) → Snowflake with 1MB chunked downloads, 4-worker concurrency, and checkpoint/resume support.
- 30 dbt analytics models across 5 layers (staged, core, intermediate, marts, analysis) enabling wallet transaction tracing with running balance calculations.
- Production Kubernetes deployment with Helm charts, Linkerd zero-trust networking, Prometheus/Grafana monitoring, and automated HPA scaling.
- 8-stage CI/CD quality gates: formatting, security scanning, 70%+ test coverage, container vulnerability scanning, and SonarQube analysis.
Blockchain Analytics Dashboard
Real-time cryptocurrency analytics dashboard with wallet tracing, portfolio tracking, and network exploration.
- Built with Nuxt 3 and Vue 3 featuring Bento Grid layout, real-time WebSocket updates, and mobile-responsive design.
- Portfolio tracker with wallet-centric transaction tracing and fund flow visualization across 6 blockchain networks.
Distributed ETL Framework
Reusable Python library for building production ETL pipelines with built-in observability and resilience patterns.
- 5,607 lines of production CLI code with ScriptRunner framework, standardized error handling, and comprehensive logging.
- Features include circuit breaker pattern, thread-safe rate limiting, checkpoint/resume support, and W3C Trace Context propagation.
Data Quality Framework
Automated data validation integrated with Airflow DAGs and dbt tests for ensuring pipeline reliability.
- Custom validators for schema enforcement, data completeness, and referential integrity across 6 cryptocurrency datasets.
- SLA monitoring with 4-10h phase thresholds, automated alerting, and recovery handlers for failed tasks.
Skills
Data Engineering
Programming
Cloud & DevOps
Observability
Security & Quality
Tools
Education
Master of Professional Studies
Data Science
University of Maryland, Baltimore County
Bachelor of Technology
Mechanical Engineering
Sree Vidyanikethan Engineering College
About Me
Passionate data engineer with expertise in building scalable, production-grade data platforms. I specialize in designing end-to-end ETL pipelines, implementing data warehouse architectures, and deploying resilient distributed systems on Kubernetes.
My work on MugenLink Network involves processing multi-chain blockchain data across 6 cryptocurrencies, leveraging Snowflake for analytics, dbt for transformation, and Airflow for orchestration — all running on a zero-trust Kubernetes cluster with comprehensive observability.