[Local Embeddings] Community Support thread #370

zzk2021 · 2024-07-05T03:21:03Z

vv5d · 2024-07-05T07:21:24Z

I don't think so at the moment.

av · 2024-07-05T08:02:08Z

You can change the embeddings API base url to the local one

GRAPHRAG_EMBEDDING_API_BASE

it must still be compatible with OpenAI API schema

SpaceLearner · 2024-07-06T17:07:06Z

It works with ollama embedding by changing the file in /opt/anaconda3/envs/graphrag/lib/python3.11/site-packages/graphrag/llm/openai/openai_embeddings_llm.py with

from typing_extensions import Unpack

from graphrag.llm.base import BaseLLM
from graphrag.llm.types import (
EmbeddingInput,
EmbeddingOutput,
LLMInput,
)

from .openai_configuration import OpenAIConfiguration
from .types import OpenAIClientTypes

import ollama

class OpenAIEmbeddingsLLM(BaseLLM[EmbeddingInput, EmbeddingOutput]):
_client: OpenAIClientTypes
_configuration: OpenAIConfiguration

def __init__(self, client: OpenAIClientTypes, configuration: OpenAIConfiguration):
    self.client = client
    self.configuration = configuration
async def _execute_llm(
    self, input: EmbeddingInput, **kwargs: Unpack[LLMInput]
) -> EmbeddingOutput | None:
    args = {
        "model": self.configuration.model,
        **(kwargs.get("model_parameters") or {}),
    }
    # embedding = await self.client.embeddings.create(
    #     input=input,
    #     **args,
    # )
    # inputs = input['input']
    # print(inputs)
    embedding_list = []
    for inp in input:
        embedding = ollama.embeddings(model="nomic-embed-text", prompt=inp)
        embedding_list.append(embedding["embedding"])
    # return [d.embedding for d in embedding.data]
    return embedding_list

bmaltais · 2024-07-06T18:53:19Z

Thank you for sharing. Pretty brutal fix but if it work then at least this is a stop gap until the Microsoft team implement a more elegant solution.

maybe checking for the presence of an ollama = true in the embedding parameters could allow to keep the default behaviour and only use the hack when true.

zeyunie-vecml · 2024-07-09T01:50:39Z

My solution was to write a local server with Flask that basically serves as a decoder of "cl100k_base" and a caller of ollama. Then change the api_base for embedding to the local host address. It works pretty well as far as I am concerned.

av · 2024-07-09T06:49:31Z

@zeyunie-vecml, yes, that's precisely what the provided server does in addition to the OAI <-> Ollama translation

UPD: messed up GitHub threads, was in relation to this server

AlonsoGuevara · 2024-07-09T22:04:54Z

I'm making this thread as our official discussion place for Local Embeddings setup and troubleshooting.
Thanks for the curiosity and proactivity!

chenli90s · 2024-07-10T09:47:21Z

i think add middleware api for oai

embeddings:
  ## parallelization: override the global parallelization settings for embeddings
  async_mode: threaded # or asyncio
  llm:
    api_key: ${GRAPHRAG_API_KEY}
    type: openai_embedding # or azure_openai_embedding
    model: mxbai-embed-large
    api_base: http://localhost:8686/api

import fastapi
from langchain_community.embeddings.ollama import OllamaEmbeddings
app = fastapi.FastAPI()
@app.post('/api/embeddings')
def embeddings(body:dict):
    # print(body)
    ollama = OllamaEmbeddings(model=body['model'])
    res = ollama.embed_documents(body['input'])
    return {
        'data': [
            {'embedding': rs } for rs in res
        ]
    }

silviachen46 · 2024-07-10T10:11:27Z

I used ollama embedding with the above modification in embedding function and was able of generating the graph, but I can't query the graph with similar modification to another embed function.
This is what i was trying to do:(modify in function _embed_with_retry in embedding.py in query/llm/oai folder)

for attempt in retryer:

            with attempt:

                embedding = ollama.embeddings(model="nomic-embed-text", prompt=text) 

                return (embedding["embedding"], len(text))

Kind of wondering what's going wrong here:)

goodmaney · 2024-07-10T12:30:09Z

#451.

automateyournetwork · 2024-07-10T18:20:14Z

Instead of ollama I am trying llama.cpp for embeddings but I get this error:

11:17:36,84 graphrag.llm.openai.create_openai_client INFO Creating OpenAI client base_url=http://localhost:8080
11:17:36,99 graphrag.index.llm.load_llm INFO create TPM/RPM limiter for nomic-embed-text-v1.5.Q5_K_M.gguf: TPM=0, RPM=0
11:17:36,99 graphrag.index.llm.load_llm INFO create concurrency limiter for nomic-embed-text-v1.5.Q5_K_M.gguf: 1
11:17:36,107 graphrag.index.verbs.text.embed.strategies.openai INFO embedding 177 inputs via 177 snippets using 12 batches. max_batch_size=16, max_tokens=8191
11:17:36,126 datashaper.workflow.workflow ERROR Error executing verb "text_embed" in create_final_entities: setting an array element with a sequence. The requested array has an inhomogeneous shape after 1 dimensions. The detected shape was (16,) + inhomogeneous part.
Traceback (most recent call last):
File "/home/fragb0x/packet_graphRAG/GRAPH/lib/python3.10/site-packages/datashaper/workflow/workflow.py", line 415, in _execute_verb
result = await result
File "/home/fragb0x/packet_graphRAG/GRAPH/lib/python3.10/site-packages/graphrag/index/verbs/text/embed/text_embed.py", line 105, in text_embed
return await _text_embed_in_memory(
File "/home/fragb0x/packet_graphRAG/GRAPH/lib/python3.10/site-packages/graphrag/index/verbs/text/embed/text_embed.py", line 130, in _text_embed_in_memory
result = await strategy_exec(texts, callbacks, cache, strategy_args)
File "/home/fragb0x/packet_graphRAG/GRAPH/lib/python3.10/site-packages/graphrag/index/verbs/text/embed/strategies/openai.py", line 61, in run
embeddings = await _execute(llm, text_batches, ticker, semaphore)
File "/home/fragb0x/packet_graphRAG/GRAPH/lib/python3.10/site-packages/graphrag/index/verbs/text/embed/strategies/openai.py", line 105, in _execute
results = await asyncio.gather(*futures)
File "/home/fragb0x/packet_graphRAG/GRAPH/lib/python3.10/site-packages/graphrag/index/verbs/text/embed/strategies/openai.py", line 100, in embed
result = np.array(chunk_embeddings.output)
ValueError: setting an array element with a sequence. The requested array has an inhomogeneous shape after 1 dimensions. The detected shape was (16,) + inhomogeneous part.

wanglufei1 · 2024-07-18T07:27:07Z

Instead of ollama I am trying llama.cpp for embeddings but I get this error:

11:17:36,84 graphrag.llm.openai.create_openai_client INFO Creating OpenAI client base_url=http://localhost:8080 11:17:36,99 graphrag.index.llm.load_llm INFO create TPM/RPM limiter for nomic-embed-text-v1.5.Q5_K_M.gguf: TPM=0, RPM=0 11:17:36,99 graphrag.index.llm.load_llm INFO create concurrency limiter for nomic-embed-text-v1.5.Q5_K_M.gguf: 1 11:17:36,107 graphrag.index.verbs.text.embed.strategies.openai INFO embedding 177 inputs via 177 snippets using 12 batches. max_batch_size=16, max_tokens=8191 11:17:36,126 datashaper.workflow.workflow ERROR Error executing verb "text_embed" in create_final_entities: setting an array element with a sequence. The requested array has an inhomogeneous shape after 1 dimensions. The detected shape was (16,) + inhomogeneous part. Traceback (most recent call last): File "/home/fragb0x/packet_graphRAG/GRAPH/lib/python3.10/site-packages/datashaper/workflow/workflow.py", line 415, in _execute_verb result = await result File "/home/fragb0x/packet_graphRAG/GRAPH/lib/python3.10/site-packages/graphrag/index/verbs/text/embed/text_embed.py", line 105, in text_embed return await _text_embed_in_memory( File "/home/fragb0x/packet_graphRAG/GRAPH/lib/python3.10/site-packages/graphrag/index/verbs/text/embed/text_embed.py", line 130, in _text_embed_in_memory result = await strategy_exec(texts, callbacks, cache, strategy_args) File "/home/fragb0x/packet_graphRAG/GRAPH/lib/python3.10/site-packages/graphrag/index/verbs/text/embed/strategies/openai.py", line 61, in run embeddings = await _execute(llm, text_batches, ticker, semaphore) File "/home/fragb0x/packet_graphRAG/GRAPH/lib/python3.10/site-packages/graphrag/index/verbs/text/embed/strategies/openai.py", line 105, in _execute results = await asyncio.gather(*futures) File "/home/fragb0x/packet_graphRAG/GRAPH/lib/python3.10/site-packages/graphrag/index/verbs/text/embed/strategies/openai.py", line 100, in embed result = np.array(chunk_embeddings.output) ValueError: setting an array element with a sequence. The requested array has an inhomogeneous shape after 1 dimensions. The detected shape was (16,) + inhomogeneous part.

I'm having the same problem. Is this solved or how to avoid it?

silviachen46 · 2024-07-18T07:42:12Z

@wanglufei1 check out #345 . Embedding with Ollama would work with modification made by user Spacelearner.

wanglufei1 · 2024-07-18T08:39:00Z

@silviachen46
Thank you very much. In my opinion, ollama uses llama.cpp at the bottom layer. I am in a Chinese environment. I tried to switch the model, gave up using nomic-embed-text, and used the qwen model. Now it works properly.

karthik-codex · 2024-07-18T15:36:56Z

The local search with embeddings from Ollama now works.
You can read full guide here:
https://medium.com/@karthik.codex/microsofts-graphrag-autogen-ollama-chainlit-fully-local-free-multi-agent-rag-superbot-61ad3759f06f
Here is the link to the repo:
https://github.com/karthik-codex/autogen_graphRAG

natoverse · 2024-07-22T20:16:06Z

Consolidating Ollama-related issues: #657

This was referenced Jul 9, 2024

Which LLM models are supported？ #341

Closed

Is it possible to use Ollama or any other local LLM for indexing instead of openai ? #432

Closed

AlonsoGuevara changed the title ~~how to change openai embedding to local embedding~~ [Local Embeddings] Community Support thread Jul 9, 2024

AlonsoGuevara mentioned this issue Jul 11, 2024

[Issue]: create_final_entities error when i use ollama embedding model #504

Closed

AlonsoGuevara added good first issue community_support Issue handled by community members labels Jul 11, 2024

AlonsoGuevara pinned this issue Jul 19, 2024

AlonsoGuevara mentioned this issue Jul 19, 2024

[Bug]: < File "C:\ProgramData\anaconda3\envs\graphrag_env0716\Lib\site-packages\pandas\core\indexers\utils.py", line 390, in check_key_length raise ValueError("Columns must be same length as key")> #631

Closed

natoverse closed this as not planned Won't fix, can't repro, duplicate, stale Jul 22, 2024

natoverse unpinned this issue Jul 22, 2024

natoverse removed the good first issue label Jul 22, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Local Embeddings] Community Support thread #370

[Local Embeddings] Community Support thread #370

zzk2021 commented Jul 5, 2024

vv5d commented Jul 5, 2024

av commented Jul 5, 2024

SpaceLearner commented Jul 6, 2024

bmaltais commented Jul 6, 2024 •

edited

Loading

zeyunie-vecml commented Jul 9, 2024

av commented Jul 9, 2024 •

edited

Loading

AlonsoGuevara commented Jul 9, 2024

chenli90s commented Jul 10, 2024 •

edited

Loading

silviachen46 commented Jul 10, 2024 •

edited

Loading

goodmaney commented Jul 10, 2024

automateyournetwork commented Jul 10, 2024

wanglufei1 commented Jul 18, 2024

silviachen46 commented Jul 18, 2024

wanglufei1 commented Jul 18, 2024

karthik-codex commented Jul 18, 2024

natoverse commented Jul 22, 2024

[Local Embeddings] Community Support thread #370

[Local Embeddings] Community Support thread #370

Comments

zzk2021 commented Jul 5, 2024

vv5d commented Jul 5, 2024

av commented Jul 5, 2024

SpaceLearner commented Jul 6, 2024

bmaltais commented Jul 6, 2024 • edited Loading

zeyunie-vecml commented Jul 9, 2024

av commented Jul 9, 2024 • edited Loading

AlonsoGuevara commented Jul 9, 2024

chenli90s commented Jul 10, 2024 • edited Loading

silviachen46 commented Jul 10, 2024 • edited Loading

goodmaney commented Jul 10, 2024

automateyournetwork commented Jul 10, 2024

wanglufei1 commented Jul 18, 2024

silviachen46 commented Jul 18, 2024

wanglufei1 commented Jul 18, 2024

karthik-codex commented Jul 18, 2024

natoverse commented Jul 22, 2024

bmaltais commented Jul 6, 2024 •

edited

Loading

av commented Jul 9, 2024 •

edited

Loading

chenli90s commented Jul 10, 2024 •

edited

Loading

silviachen46 commented Jul 10, 2024 •

edited

Loading