Pfizer Supply Chain Document Processing Externship
Build AI-powered document intelligence with Pfizer, automating healthcare and clinical supply documents using OCR, LLMs, and RAG
In 2023, the U.S. faced its worst drug shortage in a decade, with over 300 essential medications reported in shortage by federal trackers. Patients waited. Hospitals scrambled. Lives hung in the balance.
â
Behind every delay? A mountain of paperwork. Shipping labels. Purchase orders. Logistics forms. Documents that humans still read line by line. Now imagine AI that processes thousands of pharmaceutical documents in secondsâflagging compliance risks, extracting critical fields, and preventing shipment delays before they cascade.
â
In this externship, you'll build exactly that. You'll write Python scripts that pull text from messy PDFs, build and evaluate OCR pipelines that can read labels no human wants to touch, and create AI-powered search tools that answer questions across thousands of pages instantly. By the end, you'll deliver a working document intelligence system for real-world clinical supply workflows at Pfizerâand walk away with a portfolio project that proves you can build AI that actually matters.
Project Output
â
During this Externship, externs will:
- Ingest and parse scanned documents
- Extract and structure key data fields from multiple document types
- Implement intelligent search using RAG pipelines and metadata filters
- Build and test modular routing and classification logic
- (Optional) Deploy a lightweight user interface for search and querying
â
Technologies used:
- Python (scripting, data processing)
- PyMuPDF, pdfplumber (PDF parsing)
- OpenCV, PIL (image preprocessing)
- OCR Engines: Tesseract, PaddleOCR, EasyOCR
- Vector Search: LlamaIndex, FAISS, Chroma
- LLMs: Gemini API or open-source models (e.g., Mistral, Phi-2)
- UI Tools: Streamlit, Gradio (optional)
â
Final project outputs:
- Python pipeline to process document files
- Structured JSON output with field-level data and coordinates
- Functional RAG pipeline (Gemini or open-source)
- Searchable UI using Streamlit or Gradio
- Final report, system documentation, and demo
â
Why should you care? Here's what you'll get out of this Externship:
- Develop a portfolio-worthy AI/ML project
- Gain hands-on experience with cutting-edge tools and models
- Learn to build robust, production-level systems
- Improve data parsing, extraction, and visualization skills
- Tackle a real-world, high-impact business use case
Get Started with Onboarding Today Before Live Sessions Begin
Sign-up today for exposure to healthcare and supply chain while learning AI Automation!
This Externship has passed
What past Externs have to say
Skills You'll Gain
Your Schedule
Pfizer is one of the world's largest publicly traded pharmaceutical companies, developing and manufacturing medicines and vaccines that reach hundreds of millions of people globally. They partnered with BioNTech to create the first FDA-authorized COVID-19 vaccine. In 2024, Pfizer reported $63 billion in revenue and products reaching 414 million patients worldwideâmassive scale. Their clinical supply operations ensure that the right drugs get to the right patients at the right timeâa process that depends on flawless documentation across shipping, logistics, and regulatory compliance.

Russell Vassallo
Russell Vassallo is a Senior Manager of Material Planning & Logistics at Pfizer, where he brings over a decade of experience supporting the companyâs clinical supply pipeline. Based in Groton, Connecticut, Russell oversees materials management across Pfizerâs clinical portfolioâfrom early-phase API production through finished, packaged drug product shipped directly to patient studies. His work spans internal and external manufacturing partners, ensuring critical materials are available where and when theyâre needed across the global clinical supply chain.
â
Russell began his career at Pfizer as an Environmental Health & Safety intern and has progressed through roles in inventory management and materials coordination, building deep operational expertise along the way. He holds a bachelorâs degree in Environmental Science with a concentration in Chemistry from the University of Connecticut and brings a strong interest in process improvement, automation, and sustainability. In addition to his professional work, Russell enjoys his time outside of work traveling, hiking, spending time on the water, and being active playing pickleball and hockey.
Get Started Today
Sign-up today for exposure to healthcare and supply chain while learning AI Automation!










