Commit bb8a7037 authored by Clark Lin's avatar Clark Lin
Browse files

added README for multiple folders; added embed config file sample

parent af755201
.venv .venv
.idea .idea
.DS_Store .DS_Store
/data /data/RAG/chromadb/*
!/data/RAG/chromadb/README.md
/data/RAG/docs/*
!/data/RAG/docs/README.md
/setup/config_embed.json /setup/config_embed.json
\ No newline at end of file
...@@ -17,5 +17,6 @@ pip install jupyter langchain_community chromadb pypdf requests ...@@ -17,5 +17,6 @@ pip install jupyter langchain_community chromadb pypdf requests
## 设定配置文件 ## 设定配置文件
/setup/config_embed_sample.json /setup/config_embed_sample.json
## 重命名 /setup/config_embed_sample.json -> /setup/config_embed.json ## 重命名
/setup/config_embed_sample.json -> /setup/config_embed.json
## 编辑每个book的准备部分,执行 ## 编辑每个book的准备部分,执行
# Leave blank
Leave this folder blank. After embedding, chroma data is automatically created here
\ No newline at end of file
# Put documents
Put documents in this folder before embedding and retrieval
\ No newline at end of file
{
"API_URL": "https://api.cohere.com/v1/embed",
"cohere_access_token": "...",
"custom_proxies": {
"http": "http://...:8080",
"https": "http://...:8080"
},
"list_chroma_dir": ["..", "data", "RAG", "chromadb"],
"list_embed_file_path": ["..", "data", "RAG", "docs"],
"docs": {
"oracle-scm-planning": {
"model_name": "embed-multilingual-v3.0",
"file_name": "....pdf",
"split_chunk_size": 1000,
"split_overlap": 150,
"model_batch_size": 50,
"collection_ids_prefix": "ids_",
"collection_name": "cohere_..."
}
}
}
\ No newline at end of file
Markdown is supported
0% or .
You are about to add 0 people to the discussion. Proceed with caution.
Finish editing this message first!
Please register or to comment