Raman Srivastava — Senior DevOps & Cloud Engineer

// 01 · Experience

Where I've Built

AssetMark · Senior DevOps / Cloud Engineer

FEB 2024 — PRESENT

$17B wealth management platform · United States

Architected enterprise AKS platform using Terraform & GitOps (ArgoCD). Eliminated configuration drift across 15+ engineering teams; reduced cluster provisioning from hours to under 15 minutes with modular Terraform and self-service infrastructure.

Reduced deployment time by 45% via multi-stage CI/CD pipelines (GitHub Actions, Azure Pipelines) with progressive delivery — canary, blue-green, and A/B deployments via Argo Rollouts. Achieved 98%+ release success rate across 40+ microservices with automated rollback on failure.

Cut security vulnerabilities by 40% embedding Checkmarx (SAST), SonarQube, Veracode (DAST), and OPA/Kyverno (policy-as-code) into CI/CD. Implemented zero-trust Istio mTLS, automated Key Vault secret rotation, and Azure Policy for compliance (SOC 2, PCI-DSS).

Sustained 99.99% platform uptime with Dynatrace, Prometheus, Grafana, ELK Stack, and OpenTelemetry. Established SLO/SLI error-budget policies, reduced MTTR by 40%. Built chaos engineering framework (LitmusChaos) for pre-release resilience validation.

Engineered zero-downtime cluster upgrades with HPA/KEDA autoscaling, optimized node pools, and taints/tolerations. Reduced monthly compute costs by 18% (~$2M annual savings) through right-sizing, spot instances, and FinOps dashboards.

Architected enterprise event-driven platform on AKS using Apache Kafka for real-time data streaming. Tuned Kafka clusters (partition optimization, consumer group autoscaling) achieving 35% throughput increase and 20% latency reduction.

Automated PostgreSQL provisioning, schema migrations (Alembic), backup validation, and failover testing via Terraform. Built Python/Shell automation for drift detection, cost analysis, and resource cleanup, saving 6+ engineering hours weekly.

Delivered AI-powered exception analysis platform (RemediAI) — LangGraph AI agents, Azure Service Bus, PostgreSQL, FastAPI, React dashboard. Reduced .NET exception debugging time via intelligent stack trace analysis with LLM-based remediation suggestions.

Azure AKSTerraformArgoCDIstioKafkaPrometheusGrafanaAzure DevOpsGitHub ActionsPythonPostgreSQLDynatraceCheckmarxKEDALangGraph

Genpact · DevOps / Cloud Engineer

FEB 2019 — JAN 2022

Pharmaceutical enterprise · India

Led full legacy-to-AWS cloud migration of pharmaceutical data platform — migrated 50+ applications using Amazon EMR, AWS Glue, Redshift, S3, and VPC. Reduced operational costs by 40% while scaling to 150M+ records/week at 98% data accuracy.

Architected containerized big data platform on Amazon EKS with auto-scaling, multi-AZ HA, and disaster recovery for pharmaceutical analytics. Designed scalable ingestion pipelines from 80+ data sources processing 100+ TB into S3 data lake and Redshift warehouse.

Built 80+ automated data quality checks using Python, AWS Lambda, and AWS Glue — null detection, sign validation, schema validation, outlier detection. Prevented $2M+ in quarterly losses from bad incentive payouts.

Implemented end-to-end CI/CD pipelines with GitLab CI, GitHub Actions, and Jenkins for automated build/test/deploy of data workflows. Cut release cycles by 40%, increased deployment frequency 3x across teams.

Enhanced pipeline observability with CloudWatch dashboards, SNS alerting, custom log aggregation, and Glue job metrics. Automated SSIS-based ETL workloads using SQL Agent, Jenkins, and containerized execution on EC2/ECS, improving team productivity by 15%.

Led L2/L3 on-call production support for AWS workloads: incident logging, root cause analysis, hot-fix coordination, and SLA-driven recovery for pharmaceutical analytics serving 15+ business teams.

AWS EKSAWS GlueEMRRedshiftS3TerraformGitLab CIJenkinsPythonApache SparkCloudWatchLambda

// 03 · Skills

Technologies I Work With

☁️ Cloud Platforms

Azure AKSAWS EKSGCP GKEAzure DevOpsAzure MonitorAWS LambdaRedshiftKey VaultAPIMCosmos DBCloudFrontAzure Service Operator

⚙️ Infrastructure as Code

TerraformHelmArgoCDAnsibleARM TemplatesBicepCloudFormationKustomizeTerragrunt

🚀 CI/CD & GitOps

GitHub ActionsGitLab CIJenkinsAzure PipelinesArgo RolloutsFluxCanaryBlue-GreenSpinnaker

🔒 DevSecOps

CheckmarxSonarQubeVeracodeSnykTrivyOPAKyvernoIstio mTLSZero TrustAzure PolicySOC 2

📊 Observability & SRE

PrometheusGrafanaDynatraceELK StackOpenTelemetryJaegerTempoCloudWatchPagerDutySLO/SLIChaos Engineering

🗄️ Data & Streaming

Apache KafkaDatabricksPySparkAWS GlueEMRAirflowKinesisAthenaRedshift

💻 Languages

PythonBashPowerShellDotnetSQLYAMLHCL

🗃️ Databases

PostgreSQLRedisCosmos DBDynamoDBRDSAzure SQLElasticsearch

Raman
Srivastava

Where I've Built

Open Source & Side Projects

Technologies I Work With

☁️ Cloud Platforms

⚙️ Infrastructure as Code

🚀 CI/CD & GitOps

🔒 DevSecOps

📊 Observability & SRE

🗄️ Data & Streaming

💻 Languages

🗃️ Databases

Verified Credentials

Let's build
something reliable.

RamanSrivastava

Where I've Built

Open Source & Side Projects

Technologies I Work With

☁️ Cloud Platforms

⚙️ Infrastructure as Code

🚀 CI/CD & GitOps

🔒 DevSecOps

📊 Observability & SRE

🗄️ Data & Streaming

💻 Languages

🗃️ Databases

Verified Credentials

Let's buildsomething reliable.

Raman
Srivastava

Let's build
something reliable.