λͺ¨λΈ μ€λͺ
: RAG (Retrieval-Augmented Generation)
RAG (Retrieval-Augmented Generation) λͺ¨λΈμ λλμ λΉμ ν λ°μ΄ν°μμ μ 보λ₯Ό κ²μνκ³ , μ΄λ₯Ό λ°νμΌλ‘ μ§λ¬Έμ λν λ΅λ³μ μμ±νλ λͺ¨λΈμ
λλ€.
1. π νμΌ μ
λ‘λ (PDF, TXT)
2. π ν
μ€νΈ μΆμΆ & chunk λλκΈ°
3. π§ 벑ν°ν (embedding)
4. πΎ λ²‘ν° μ μ₯ (FAISS)
5. β μ§λ¬Έ μ
λ ₯ β κ²μ + LLM μλ΅
6. π± Streamlit UIλ‘ μ λΆ μ°κ²°
π μ 체 ꡬ쑰 μ€κ³
rag_file_chat/
βββ frontend/
β βββ app.py β π± Streamlit UI, μ¬μ©μ μ
λ ₯ λ° μλ΅ μΆλ ₯
βββ rag/
β βββ loader.py β π PDF, TXT μ½κΈ° + chunk λλκΈ°
β βββ vector_store.py β π§ ν
μ€νΈ μλ² λ© + λ²‘ν° μ μ₯/λ‘λ
β βββ qa_engine.py β β RAG κΈ°λ° μ§λ¬Έ μ²λ¦¬ + π§Ύ μ¬μ©μ μ§λ¬Έ/μλ΅ κΈ°λ‘ κ΄λ¦¬ (Chat Memory)
βββ data/
β βββ uploads/ β π μ
λ‘λλ μλ³Έ λ¬Έμ μ μ₯
β βββ vectors/ β πΎ λ²‘ν° DB μ μ₯μ
βββ .env β π API ν€ λ± λ―Όκ° μ 보
βββ requirements.txt β π¦ ν¨ν€μ§ μμ‘΄μ± λͺ©λ‘
$env :PYTHONPATH=" ." ; streamlit run frontend/app.py # Windows PowerShell