Free views, likes and subscribers at YouTube. Now!
Get Free YouTube Subscribers, Views and Likes

Multi-modal RAG: Chat with Docs containing Images

Follow
Prompt Engineering

Learn how to build a multimodal RAG system using CLIP mdoel.

LINKS:
Notebook: https://tinyurl.com/pfc64874
Flow charts in the paper:
https://tinyurl.com/4pp78xuf
https://tinyurl.com/5yeww5py
https://tinyurl.com/4un6y6x5
https://tinyurl.com/2jkbb3ma


RAG Beyond Basics Course:
https://promptssite.thinkific.com/c...

Let's Connect:
Discord:   / discord  
☕ Buy me a Coffee: https://kofi.com/promptengineering
| Patreon:   / promptengineering  
Consulting: https://calendly.com/engineerprompt/c...
Business Contact: [email protected]
Become Member: http://tinyurl.com/y5h28s6h

Preconfigured localGPT VM: https://bit.ly/localGPT (use Code: PromptEngineering for 50% off).

Signup for Newsletter, localgpt:
https://tally.so/r/3y9bb0


00:00 Introduction to Multimodal RAC Systems
01:24 First Approach: Unified Vector Space
02:23 Second Approach: Grounding Modalities to Text
03:57 Third Approach: Separate Vector Stores
06:26 Code Implementation: Setting Up
09:05 Code Implementation: Downloading Data
11:13 Code Implementation: Creating Vector Stores
14:00 Querying the Vector Store


All Interesting Videos:
Everything LangChain:    • LangChain  

Everything LLM:    • Large Language Models  

Everything Midjourney:    • MidJourney Tutorials  

AI Image Generation:    • AI Image Generation Tutorials  

posted by durhoossymx