Data Engineer | Python • SQL • AWS • Apache Spark
Akshit Verma
Portfolio
I am a Data Engineer with expertise in building and maintaining scalable data pipelines and data warehousing solutions. Proficient in SQL, Python, and Apache Spark for data processing and transformation. Experienced in designing ETL workflows, optimizing data queries, and working with cloud platforms for data analytics.
I have a strong foundation in database management, data modeling, and performance optimization ensuring efficient data flow and reliability across systems. Skilled in leveraging AWS services including Bedrock, API Gateway, and cloud infrastructure for building robust, production-grade data solutions.
Programming Languages: Python, SQL, HTML
Cloud & DevOps: AWS, Docker, Linux
Frameworks: FastAPI, Streamlit
Databases & Vector Stores: Pinecone
Version Control: Git, GitHub
Deployment & Hosting: Vercel, Netlify
[ 2020 — 2024 ]
Built a strong foundation in Object-Oriented Programming, Data Structures, Algorithms, and Database Management Systems. Focused on software engineering principles and large-scale data processing.
Tech Stack: Python, Pre-trained Models, Streamlit, AWS Bedrock
Developed a predictive analytics application using pre-trained machine learning models to forecast future trends and patterns with high accuracy. Built an interactive frontend using the Streamlit framework, enabling users to visualize predictions in real-time. Integrated AWS Bedrock for advanced chatbot functionality to provide insights and answer user queries about predictions.
Tech Stack: Python, Custom Deep Learning Model, Streamlit, AWS
Created a medical imaging application that detects diseases from uploaded X-ray images with confidence percentages. Built and trained a custom deep learning model using open-source datasets for accurate disease classification. Deployed the trained model on AWS and integrated it with a Streamlit frontend for a user-friendly interface.
Tech Stack: Python, AWS Bedrock, Streamlit, AWS API Gateway
Developed an application that extracts text from images and converts unstructured data into structured format for easy processing. Enabled users to download extracted data in multiple formats including Excel, JSON, and CSV. Created a REST API using AWS API Gateway for seamless integration, allowing developers to consume the extraction service programmatically.
Tech Stack: RAG, Vector Embeddings, Pinecone, Foundational Models
Built an intelligent chatbot using Retrieval Augmented Generation (RAG) that answers user medical queries with accuracy. Converted medical books and documents into vector embeddings for efficient semantic search and retrieval. Leveraged Pinecone vector database for storing and retrieving embeddings, ensuring fast and relevant responses.
Tech Stack: Streamlit, FastAPI, AWS Bedrock, Python
Developed an application that automatically generates comprehensive unit test cases when developers upload or paste their code. Built a backend API using FastAPI for efficient code processing and test generation logic. Integrated AWS Bedrock foundational models (LLM) to intelligently generate test cases and measure code coverage improvements.
Tech Stack: Python, Location Services, Real-time Data, Mobile-friendly
Created an application helping farmers maximize crop selling profits by connecting them with nearby mandis and real-time market rates. Implemented location-based services to identify the nearest agricultural markets and display live crop prices. Provided comprehensive marketplace information including transportation, storage, and other logistical support details.
Tech Stack: Python, Database Management, Notification System
Developed a comprehensive patient health management system for hospital nurses to efficiently track patient information and medical history. Implemented an automated reminder system to alert nurses about medication schedules and dosages at specified times. Built an inventory management module for tracking medical supplies and medication stock levels. Enabled data export functionality allowing nurses to generate and save patient records in structured formats for documentation.
Data Engineer [ Feb 2025 — Present ]
• Engineered a Python-based predictive analytics platform utilizing machine learning to forecast market trends with high accuracy, enabling data-driven decision-making.
• Developed an intelligent computer vision application for automated text extraction, converting unstructured visual data into organized formats and reducing manual processing time.
• Architected and optimized end-to-end data pipelines on AWS, ensuring 99.9% data reliability and seamless flow between distributed application components.
• Collaborated with cross-functional engineering teams to implement CI/CD best practices and maintain high-quality code standards for production deployments.
I am active on LinkedIn, where I share insights on Backend Engineering, Generative AI, and Scalable Data Systems. Feel free to connect for collaborations or opportunities.
Check out my latest projects and contributions to open-source software. My repositories focus on AI-driven applications, automation tools, and backend architectures.
Email: akshitverma2100@gmail.com
Location: Noida, Uttar Pradesh, India