Immediate Joiner

utkarsh@bajpai:~$_whoami

Utkarsh
Bajpai

›

Gurugram, Haryana

I build scalable infrastructure and AI-powered systems that run at scale. Passionate about developer experience, cloud-native architectures, and open-source.

View Projects Resume Contact me →

scroll

// experience

Where I've Worked

3.5+ years building cloud-native infrastructure, Kubernetes platforms, and Java microservices in production at scale.

Working at Incedo Inc. on-site for Feedzai — a market leader in AI-driven financial fraud prevention protecting the world's largest banks through real-time risk management and AML solutions. Domain expertise in AML and Transaction Fraud for Banking (TFB) across transaction processing, event enrichment, and multi-tenancy for key global clients.

Key Achievements

▸Led and mentored a team of 3 junior engineers — owning the full journey from sprint planning and development to deployment and production release with on-time delivery
▸Led the migration from standalone Docker to Kubernetes using Helm Charts and Operators, shifting configs from Ansible to ConfigMaps/Secrets with GitLab CI pipelines
▸Owned 40+ Blue-Green and rolling deployments over 3 years with a 97%+ success rate, ensuring zero-downtime releases with proactive client communication
▸Implemented Java microservices for database operations, webhook integrations, and REST API configurations — reducing delete operation effort for reference data entities by 80%
▸Performed RCA on 10+ production-critical OOM, disk exhaustion, slowness, log rotation, and PostgreSQL indexing incidents, communicating findings to stakeholders
▸Monitored and maintained ETL pipelines ensuring reliable data transfer to AWS S3, resolving failures through log analysis and cron-job debugging
▸Designed and maintained Liquibase changesets for database schema and index optimisation across PostgreSQL in a multi-tenant environment with JWT-based auth flows

Tech Stack

KubernetesHelmDockerGitLab CIJavaAWS EKSAWS S3AWS CloudWatchPostgreSQLDynamoDBRabbitMQKafkaPrometheusGrafanaLokiTerraformAnsibleLiquibaseBashLinux

// projects

Things I've Built

Production systems and engineering initiatives from 3.5+ years at Feedzai's fraud detection platform.

Featured

DevOps

Kubernetes Blue-Green Deployment Pipeline

End-to-end Blue-Green and rolling deployment pipeline for a production AI fraud detection platform on AWS EKS. Achieved 97%+ success rate across 40+ releases with zero downtime and full rollback ownership.

KubernetesHelmGitLab CIAWS EKSPrometheus+2

active

Featured

DevOps

Docker → Kubernetes Migration

Led the full production migration from standalone Docker to Kubernetes using Helm Charts and Kubernetes Operators. Shifted config management from Ansible playbooks to ConfigMaps/Secrets with GitLab CI automation — zero production disruption.

KubernetesHelmDockerAnsibleGitLab CI+2

active

Featured

Backend

Java Microservices — REST API & Webhook Platform

Implemented Java microservice features for a multi-tenant financial fraud detection system: database operations, webhook integrations, REST API configurations, and JWT-based authentication. Reduced reference data delete effort by 80%.

JavaREST APIsPostgreSQLDynamoDBLiquibase+2

active

DevOps

Observability Stack — Prometheus, Grafana & Loki

Set up and maintained the full observability stack for production Kubernetes workloads — metrics with Prometheus, dashboards with Grafana, and log aggregation with Loki for proactive incident detection and RCA.

PrometheusGrafanaLokiKubernetesAWS CloudWatch

active

Cloud

ETL Pipeline Monitoring & Reliability

Monitored and maintained ETL pipelines for reliable data transfer to AWS S3. Resolved failures and data inconsistencies through log analysis, cron-job debugging, and root cause analysis with CloudWatch.

AWS S3AWS CloudWatchBashLinuxPostgreSQL+1

active

Tools

Production Incident RCA Framework

Systematic approach to analysing and resolving production-critical incidents — OOM kills, disk exhaustion, ETL failures, log rotation, and PostgreSQL indexing issues — using Linux tooling, grep, awk, ps, and curl.

LinuxBashgrep/awk/sedPostgreSQLPrometheus+2

active

// skills

Technical Arsenal

Tools and technologies I use to ship production systems — from infrastructure to intelligent applications.

KubernetesExpert

DockerExpert

TerraformExpert

ArgoCDAdvanced

HelmAdvanced

GitHub ActionsExpert

// certifications

Certifications

Industry-recognised credentials across cloud, Kubernetes, and AI — with two more in progress.

AWS Certified Cloud Practitioner

Amazon Web Services

Jan 2024

Certified Kubernetes Administrator (CKA)

In Progress

CNCF / Linux Foundation

Expected Jun 2025

AWS Certified Solutions Architect – Associate

In Progress

Amazon Web Services

Expected Aug 2025

Anthropic Claude 101

Anthropic

Jan 2024

Anthropic Claude Code in Action

Anthropic

Jan 2024

Anthropic AI Fluency: Framework & Foundations

Anthropic

Jan 2024

Mastering Linux: The Comprehensive Guide

Online Course

Jan 2023

// achievements

Awards & Impact

Recognition, measurable outcomes, and milestones from 3.5+ years in production engineering.

Made a Difference Award (×2)

Received twice at Incedo Inc. for outstanding performance and direct recognition from the client Feedzai for exceptional contributions across deployment ownership, incident management, and feature delivery.

Jan 2024

97%+ Deployment Success Rate — 40+ Releases

Owned 40+ Blue-Green and rolling Kubernetes deployments over 3 years with a 97%+ success rate. Ensured zero-downtime releases with proactive client communication and full rollback ownership.

Jan 2024

80% Reduction in Delete Operation Effort

Implemented Java microservice functionality enabling client-owned execution flows, reducing reference data entity delete operation effort by 80% through optimised REST API design.

Jan 2023

Led Docker → Kubernetes Migration

Drove the full migration from standalone Docker to Kubernetes using Helm Charts and Operators. Shifted configuration management from Ansible to ConfigMaps/Secrets with GitLab CI automation — zero production disruption.

Jan 2023

RCA & Resolution of 10+ Critical Production Incidents

Analysed and resolved 10+ production-critical incidents including OOM kills, disk exhaustion, ETL pipeline failures, log rotation issues, and PostgreSQL indexing degradation. Communicated findings to stakeholders.

Jan 2024

Team Lead — Mentored 3 Junior Engineers

Led and mentored a team of 3 junior engineers, owning the full engineering lifecycle from sprint planning and development through to deployment, production release, and client communication.

Jan 2023

B.Tech CGPA 9.1 — SRM Institute of Science & Technology

Graduated with a CGPA of 9.1 in Computer Science Engineering from SRM Institute of Science and Technology, Chennai (2018–2022).

Jun 2022

// writing

Technical Writing

Deep-dives, war stories, and lessons from building production systems.

Featured

Zero-Downtime Database Migrations at Scale

How we migrated 2TB of PostgreSQL data across 50+ microservices without a single second of downtime. The tooling, the playbook, and the lessons learned.

PostgreSQLDevOpsArchitecture

Nov 2023·12 min

Read

Featured

Building a Production Kubernetes Platform in 2024

A complete guide to designing, deploying, and operating a Kubernetes platform for 50+ engineering teams. Covers GitOps, observability, cost management, and developer experience.

KubernetesPlatform EngineeringDevOps

Sep 2023·18 min

Read

LLMs in Production: What Nobody Tells You

After running LLM-powered features for 200K daily users for 6 months, here's what I wish I knew about latency, cost, caching, and failure modes.

AI/MLBackendProduction

Aug 2023·15 min

Read

Kafka vs. RabbitMQ vs. Pulsar — A Production Comparison

We ran all three in production for 18 months. Here's the honest comparison: throughput, operational overhead, ecosystem maturity, and when to use each.

KafkaMessagingArchitecture

Jun 2023·10 min

Read

// contact

Get In Touch

Open to new opportunities, interesting projects, and good conversations.

contact.sh

$ echo $EMAIL

utkarshbajpai44@gmail.dev

$ echo $LOCATION

Gurugram, Haryana

$ echo $STATUS

✓ Open to new roles

utkarshbajpai44@gmail.dev

linkedin.com/in/helloutkarsh

github.com/helloutkarsh

UtkarshBajpai

Where I've Worked

Software Engineer

Key Achievements

Tech Stack

Things I've Built

Kubernetes Blue-Green Deployment Pipeline

Docker → Kubernetes Migration

Java Microservices — REST API & Webhook Platform

Observability Stack — Prometheus, Grafana & Loki

ETL Pipeline Monitoring & Reliability

Production Incident RCA Framework

Technical Arsenal

Certifications

AWS Certified Cloud Practitioner

Certified Kubernetes Administrator (CKA)

AWS Certified Solutions Architect – Associate

Anthropic Claude 101

Anthropic Claude Code in Action

Anthropic AI Fluency: Framework & Foundations

Mastering Linux: The Comprehensive Guide

Awards & Impact

Made a Difference Award (×2)

97%+ Deployment Success Rate — 40+ Releases

80% Reduction in Delete Operation Effort

Led Docker → Kubernetes Migration

RCA & Resolution of 10+ Critical Production Incidents

Team Lead — Mentored 3 Junior Engineers

B.Tech CGPA 9.1 — SRM Institute of Science & Technology

Technical Writing

Zero-Downtime Database Migrations at Scale

Building a Production Kubernetes Platform in 2024

LLMs in Production: What Nobody Tells You

Kafka vs. RabbitMQ vs. Pulsar — A Production Comparison

Get In Touch

Where I've Worked

Software Engineer

Key Achievements

Tech Stack

Things I've Built

Kubernetes Blue-Green Deployment Pipeline

Docker → Kubernetes Migration

Java Microservices — REST API & Webhook Platform

Observability Stack — Prometheus, Grafana & Loki

ETL Pipeline Monitoring & Reliability

Production Incident RCA Framework

Technical Arsenal

Certifications

AWS Certified Cloud Practitioner

Certified Kubernetes Administrator (CKA)

AWS Certified Solutions Architect – Associate

Anthropic Claude 101

Anthropic Claude Code in Action

Anthropic AI Fluency: Framework & Foundations

Mastering Linux: The Comprehensive Guide

Awards & Impact

Made a Difference Award (×2)

97%+ Deployment Success Rate — 40+ Releases

80% Reduction in Delete Operation Effort

Led Docker → Kubernetes Migration

RCA & Resolution of 10+ Critical Production Incidents

Team Lead — Mentored 3 Junior Engineers

B.Tech CGPA 9.1 — SRM Institute of Science & Technology

Technical Writing

Zero-Downtime Database Migrations at Scale

Building a Production Kubernetes Platform in 2024

LLMs in Production: What Nobody Tells You

Kafka vs. RabbitMQ vs. Pulsar — A Production Comparison

Get In Touch

Utkarsh
Bajpai