Work History
Download CV as PDFAmit Kumar Biswas
London, United Kingdom · amitkbiswas01@gmail.com · linkedin.com/in/amitkbiswas01
London, United Kingdom · +447960177271 · amitkbiswas01@gmail.com · linkedin.com/in/amitkbiswas01
Professional Summary
AWS-certified Data Engineer experienced in building production-grade data platforms on AWS. Expertise in Python, SQL, and serverless architecture for ETL/ELT pipelines, real-time data ingestion and streaming, and analytics-ready data modelling. Hands-on experience delivering GenAI solutions with Amazon Bedrock, including RAG pipelines, LLM evaluation, and AI-powered automation. Comfortable owning the full data platform lifecycle from architecture through deployment and monitoring.
Certifications
AWS Certified Data Engineer – Associate — Amazon Web Services (AWS)Technical Expertise
- Data Engineering:
- ETL/ELT pipelines (batch & streaming), Apache Kafka (AWS MSK), event-driven architecture, data modelling (Star Schema, Dimensional, Medallion), data quality monitoring & validation, schema design & query optimization, REST API ingestion
- GenAI & ML:
- Amazon Bedrock (Nova, Titan), RAG pipelines, LLM evaluation & benchmarking, prompt engineering, Amazon Transcribe, semantic chunking, knowledge base design
- Cloud & Platform:
- AWS MSK, Redshift, Athena, S3, Glue (Data Catalog, Crawlers), Lambda, Step Functions, API Gateway, Cognito, CloudWatch, IAM, Lake Formation, Bedrock, DynamoDB, CloudFormation, AWS CDK
- Programming:
- Python (advanced), SQL (window functions, query optimization), TypeScript, JavaScript, Bash
- Engineering Practices:
- CI/CD (GitHub Actions, AWS Amplify), Docker, Agile/Scrum, system design, code review, technical mentorship
Experience
- Designed and maintained production-grade AWS data platform (MSK, RDS, Lambda, Redshift) supporting internal and client-facing BI reporting in Salesforce.
- Built automated ETL pipelines in Python (Glue, Step Functions) ingesting data from Jira, Confluence, and CloudHealth APIs, reducing manual reporting effort by ~40%.
- Developed RESTful APIs using Python (AWS Chalice), API Gateway, and Cognito for secure internal and client-facing data access.
- Optimized Redshift query performance through distribution and sort key tuning, improving dashboard load times and reducing compute costs.
- Built an internal GDPR data deletion tool using Python and Lambda with automated scheduling via Step Functions/EventBridge and full audit logging for compliance reporting.
- Architected an event-driven AI document generation pipeline using Amazon Bedrock (Nova Lite V2), processing raw transcripts into structured content via LLM and rendering production-ready PDF assets.
- Built a serverless GenAI pipeline integrating Bedrock Titan Image Generator for artwork enhancement and Amazon Transcribe for speech evaluation, with real-time job tracking.
- Collaborated with analysts and architects on data maturity assessments, improving modeling standards and reporting reliability.
- Designed and built backend APIs and data flows using TypeScript and RESTful architecture, serving as the primary data layer for production web applications.
- Improved application data throughput and response times by ~30% through caching strategies, query optimization, and modular service design.
- Led backend architecture and development of an internal recruitment platform with live video upload pipeline, replacing a costly third-party integration.
- Mentored junior developers, reducing onboarding time from 7–8 weeks to 4 weeks and improving code review standards across the team.
- Developed backend services for a fintech application using Python, FastAPI, and PostgreSQL, supporting KYC, OTP, and licensing workflows - app reached 1,000+ downloads within months of launch.
- Integrated Microsoft Power BI data into a subscription analytics platform via FastAPI, enabling self-serve reporting for non-technical stakeholders.
- Implemented secure authentication and validation workflows ensuring regulatory compliance across financial data processing flows.
Education
MSc Data Science (Distinction)
Distributed Systems, Machine Learning, Data Mining, Statistical Modelling. Dissertation commissioned by Rebura Ltd.
BSc Computer Science & Engineering (First Class)
Database Management, Competitive Programming, Pattern Recognition