AI
Technology

Outamation Advanced - AI-Powered Document Insights and Data Extraction Externship

What if AI could make sense of a 200-page mortgage blob faster than a human — and actually do something useful with it? In this externship, you’ll train models to read, extract, and search real mortgage docs using Python, OCR, and RAG. It’s document intelligence meets automation — and you're the architect.

10 weeks
Starts
September 22, 2025

What if you could train an AI to make sense of a 200-page mortgage document better than most professionals ever could?

Stop imagining — because that’s exactly what you’ll do in the Outamation AI-Powered Document Intelligence Externship.This isn’t your average “learn Python” type of project. You’re stepping into the mortgage industry’s biggest bottleneck — massive, messy, scanned document blobs — and building the system that solves it. From OCR to smart extraction to RAG-powered search, you’ll teach machines how to read, reason, and route information with precision.

In the time you complete this externship you'll have used the following technologies:

  • Python for scripting and data processing
  • PyMuPDF, pdfplumber for PDF parsing
  • OpenCV, PIL for image preprocessing
  • OCR Engines: Tesseract, PaddleOCR, EasyOCR
  • LlamaIndex, FAISS or Chroma for vector-based retrieval
  • Gemini API and open-source LLMs (Mistral, Phi-2)
  • Streamlit or Gradio for optional user interface
  • Google Colab for experimentation and collaboration

This isn't a sandbox. It’s real-world tough. You’ll build a production-ready pipeline, test it against actual document chaos, and maybe even deploy an interface that makes your AI usable by actual humans. If you’re ready to stop talking about AI and actually build something that matters — this is your moment!

Project Output

Here's what you'll be doing:

  • Explore How AI Reads Documents: Learn how computer vision and NLP models understand and classify documents in industries like mortgage processing.
  • Write Python for Document Extraction: Build scripts using PyPDF2, PyMuPDF, and pdfplumber to pull key data from unstructured PDFs.
  • Build AI-Powered Search Tools: Use tools like LlamaIndex and FAISS to create search systems over embedded document content.
  • Compare Open-Source AI Models: Test models like LLaMA and HuggingFace on real document tasks and share what works best.
  • Deliver a Final Analysis: Package your findings into a report that showcases your tech stack, your logic, and your results.

Key deliverables for this externship will include:

  1. Python pipeline to process a 200+ page mortgage blob
  2. Structured field-level JSON output with coordinates or labels
  3. Fully functional RAG pipeline using Gemini or open-source models
  4. Optional search interface (Streamlit or Gradio)
  5. Final documentation, reflection report, and system demo
Get Started Today

Turn complex documents into actionable insights with cutting-edge AI tools and real-world experience.

What past Externs have to say

Erik Schalk
Beats by Dre Extern
"Extern played a crucial role in bridging the gap between my formal business education & real-world application. The opportunity to present actionable recommendations to the Head of Customer Insights at Beats by Dre was invaluable, propelling my leadership journey and paving the way for my current role at Rolls-Royce.”
Now a Project Lead
at Rolls-Royce
Tori Nguyen
AT&T Extern
“I credit the Externship as the sole reason I was hired to intern at AT&T. I had never heard of the AT&T internship program and would never have applied to it if I hadn’t gone through the Externship. It allowed me to develop my skills, showcase my work and ultimately stand out so I was awarded the opportunity to interview the CEO of AT&T!”
Became an intern
at AT&T
Richard Wilson
Meta Extern
"The Externship is a great way to introduce students to what it is like to work on a project. I became a lot more data-focused and analytical in my approach. As an industrial engineering major, people questioned how the externship was related to my studies. My journey has shown me that you don’t have to be limited by your major."
Now a Technical
Program Manager at Meta
Maisa Mirza
HP Tech Ventures Externship
“During my venture capital externship, I learned how to use industry-standard metrics and tools like SQL and Tableau to determine startup success potential.This opportunity helped me stand out as a candidate and opened new doors for me - including offers from EY and Accenture."
Now an Analyst
at Accenture
Garrett Boyce
HP Tech Ventures Externship
“If I hadn’t done a VC externship during term time, I know I’d be spending hours watching YouTube videos just to understand which careers I might enjoy! I don’t think I fully understood the value of an externship till I actually did one and now I want to argue that externships should be a part of the college experience for every student.”
Now an Analyst at Boston Consulting Group
Diego Juarez
Crafted Capital Externship
“My Externship really changed my career trajectory. As students, we often chase after Fortune 500 work experiences, but the intimate experience of getting to work with a startup can actually really help you learn a lot and level up what you can bring to the table. My cohort even had the opportunity to speak with the founder of Crafted Capital and learn from his firsthand experience.”
Now a CLDP Analyst at
JP Morgan Chase & Co

Skills You'll Gain

Write Python scripts to extract and organize data from messy, real-world PDFs.
Use OCR and NLP libraries to identify and tag document content.
Build and test Retrieval-Augmented Generation (RAG) pipelines using LlamaIndex.
Evaluate and document the performance of open-source AI models.
Visualize and explain complex tech workflows in simple, clear presentations.

Your Schedule

10 weeks
Project 1
Introduction to Mortgage Documents and AI (Week 1)
Project 2
Python and Image Preprocessing (Week 2)
Project 3
Text Extraction from PDFs + Field Heuristics (Week 3)
Project 4
Advanced OCR Comparison and Layout-Aware Extraction (Week 4)
Project 5
Introduction to Retrieval-Augmented Generation (RAG) (Weeks 5 & 6)
Project 6
Advanced RAG and Open-Source Experiments (Week 7)
Project 7
Blob Processing, Classification, and Routing (Week 8)
Project 8
Multi-Document Search + Optional UI (Week 9)
Project 9
Final Integration, Testing, and Evaluation (Week 10)

About Outamation

Outamation is a tech startup founded by fintech professionals with deep domain knowledge in workflow automation and rapid application development. The Outamation team has worked in North America, Europe, and APAC, with more than 150 years of collaborative experience in technology industries. Using their proprietary Drip innovation by Outamation™ approach, they innovate and deliver solutions faster. The depth of technology and subject matter expertise has made Outamation a trusted partner for clients and partners in the real estate and healthcare industries. 

The Home Depot is not accepting or considering any applications for this Externship through other channels.
Meet Your Host

Lisa Guadagno

Sr. Principal at Energy Innovation Capital

Results-oriented executive with a proven track record in driving innovation, implementing global programs, building high performing teams and optimizing operational centers to deliver exceptional results. Earned reputation as a strong, resourceful and dynamic visionary, with polished communication skills and natural authority who builds collaborative relationships with internal and external partners to gain support and buy-in, formulating product roadmaps and leveraging technology to deliver high impact change. Proven success implementing innovation programs that achieve sustainable growth, simplify processes, enhance risk effectiveness, and ensure employee and customer satisfaction, with minimal disruption to core activities. Currently serving as a senior industry advisor and board member, providing strategic insights and guidance to organizations.

Get Started Today

Turn complex documents into actionable insights with cutting-edge AI tools and real-world experience.