Pfizer Supply Chain Document Processing Externship
Build AI-powered document intelligence with Pfizer, automating healthcare and clinical supply documents using OCR, LLMs, and RAG
In 2023, the U.S. faced its worst drug shortage in a decade, with over 300 essential medications reported in shortage by federal trackers. Patients waited. Hospitals scrambled. Lives hung in the balance.
Behind every delay? A mountain of paperwork. Shipping labels. Purchase orders. Logistics forms. Documents that humans still read line by line. Now imagine AI that processes thousands of pharmaceutical documents in seconds—flagging compliance risks, extracting critical fields, and preventing shipment delays before they cascade.
In this externship, you'll build exactly that. You'll write Python scripts that pull text from messy PDFs, build and evaluate OCR pipelines that can read labels no human wants to touch, and create AI-powered search tools that answer questions across thousands of pages instantly. By the end, you'll deliver a working document intelligence system for real-world clinical supply workflows at Pfizer—and walk away with a portfolio project that proves you can build AI that actually matters.
Project Output
During this Externship, externs will:
- Ingest and parse scanned documents
- Extract and structure key data fields from multiple document types
- Implement intelligent search using RAG pipelines and metadata filters
- Build and test modular routing and classification logic
- (Optional) Deploy a lightweight user interface for search and querying
Technologies used:
- Python (scripting, data processing)
- PyMuPDF, pdfplumber (PDF parsing)
- OpenCV, PIL (image preprocessing)
- OCR Engines: Tesseract, PaddleOCR, EasyOCR
- Vector Search: LlamaIndex, FAISS, Chroma
- LLMs: Gemini API or open-source models (e.g., Mistral, Phi-2)
- UI Tools: Streamlit, Gradio (optional)
Final project outputs:
- Python pipeline to process document files
- Structured JSON output with field-level data and coordinates
- Functional RAG pipeline (Gemini or open-source)
- Searchable UI using Streamlit or Gradio
- Final report, system documentation, and demo
Why should you care? Here's what you'll get out of this Externship:
- Develop a portfolio-worthy AI/ML project
- Gain hands-on experience with cutting-edge tools and models
- Learn to build robust, production-level systems
- Improve data parsing, extraction, and visualization skills
- Tackle a real-world, high-impact business use case
Get Started with Onboarding Today Before Live Sessions Begin
Sign-up today for exposure to healthcare and supply chain while learning AI Automation!
This Externship has passed
What past Externs have to say
Skills You'll Gain
Your Schedule

It's Pfizer's mission to be the premier, innovative biopharmaceutical company, making breakthroughs that change patients’ lives. Good health is vital to all of us and finding sustainable solutions to the most pressing health care challenges of our world cannot wait. That's why we at Pfizer are committed to applying science and our global resources to improve health and well-being at every stage of life. We strive to provide access to safe, effective and affordable medicines and related health care services to the people who need them.
We have a leading portfolio of products and medicines that support wellness and prevention, as well as treatment and cures for diseases across a broad range of therapeutic areas; and we have an industry-leading pipeline of promising new products that have the potential to challenge some of the most feared diseases of our time, like Alzheimer's disease and cancer. Join us and we can work together for a healthier world.

Russell Vassallo
Russell Vassallo is a Senior Manager of Material Planning & Logistics at Pfizer, where he brings over a decade of experience supporting the company’s clinical supply pipeline. Based in Groton, Connecticut, Russell oversees materials management across Pfizer’s clinical portfolio—from early-phase API production through finished, packaged drug product shipped directly to patient studies. His work spans internal and external manufacturing partners, ensuring critical materials are available where and when they’re needed across the global clinical supply chain.
Russell began his career at Pfizer as an Environmental Health & Safety intern and has progressed through roles in inventory management and materials coordination, building deep operational expertise along the way. He holds a bachelor’s degree in Environmental Science with a concentration in Chemistry from the University of Connecticut and brings a strong interest in process improvement, automation, and sustainability. In addition to his professional work, Russell enjoys his time outside of work traveling, hiking, spending time on the water, and being active playing pickleball and hockey.
Get Started Today
Sign-up today for exposure to healthcare and supply chain while learning AI Automation!










