Fix: Make OpenAIEmbedding work when token usage info is not set #369

aniketmaurya · 2024-11-27T00:47:03Z

Before submitting

Was this discussed/agreed via a Github issue? (no need for typos and docs improvements)
Did you read the contributor guideline, Pull Request section?
Did you make sure to update the docs?
Did you write any new necessary tests?

What does this PR do?

OpenAIEmbeddingSpec requires to set token usage in the context otherwise it fails with the following example. This PR, makes it work and set usage to 0 when not provided.

Error

  File "/Users/aniket/Projects/github/LitServe/src/litserve/api.py", line 90, in encode_response
    return self._spec.encode_response(output, **kwargs)
           ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
  File "/Users/aniket/Projects/github/LitServe/src/litserve/specs/openai_embedding.py", line 128, in encode_response
    "prompt_tokens": context_kwargs.get("prompt_tokens", 0),
                     ^^^^^^^^^^^^^^^^^^
AttributeError: 'NoneType' object has no attribute 'get'

Code

import litserve as ls
from sentence_transformers import SentenceTransformer

class EmbeddingsAPI(ls.LitAPI):
    def setup(self, device):
        self.model = SentenceTransformer('all-MiniLM-L6-v2', device=device)

    def predict(self, inputs):
        embeddings = self.model.encode(inputs)
        return embeddings

if __name__ == "__main__":
    api = EmbeddingsAPI()
    server = ls.LitServer(api, spec=ls.OpenAIEmbeddingSpec())
    server.run(port=8000)

PR review

Anyone in the community is free to review the PR once the tests have passed.
If we didn't discuss your PR in GitHub issues there's a high chance it will not be merged.

Did you have fun?

Make sure you had fun coding 🙃

codecov · 2024-11-27T00:52:19Z

Codecov Report

Attention: Patch coverage is 50.00000% with 2 lines in your changes missing coverage. Please review.

Project coverage is 94%. Comparing base (771a1c9) to head (d1f56ed).
Report is 1 commits behind head on main.

Additional details and impacted files

@@         Coverage Diff         @@
##           main   #369   +/-   ##
===================================
- Coverage    94%    94%   -0%     
===================================
  Files        25     25           
  Lines      1563   1564    +1     
===================================
  Hits       1465   1465           
- Misses       98     99    +1

fix encode response

bc6987e

aniketmaurya requested review from lantiga, ethanwharris, Andrei-Aksionov and Borda as code owners November 27, 2024 00:47

update

d1f56ed

aniketmaurya enabled auto-merge (squash) November 27, 2024 00:59

robTheBuildr approved these changes Nov 27, 2024

View reviewed changes

ethanwharris approved these changes Nov 27, 2024

View reviewed changes

Borda approved these changes Nov 27, 2024

View reviewed changes

justusschock approved these changes Nov 27, 2024

View reviewed changes

aniketmaurya merged commit ab6828a into main Nov 27, 2024
20 of 21 checks passed

aniketmaurya deleted the aniket/fix-encode-response branch November 27, 2024 11:38

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix: Make OpenAIEmbedding work when token usage info is not set #369

Fix: Make OpenAIEmbedding work when token usage info is not set #369

aniketmaurya commented Nov 27, 2024 •

edited

Loading

codecov bot commented Nov 27, 2024 •

edited

Loading

Fix: Make OpenAIEmbedding work when token usage info is not set #369

Fix: Make OpenAIEmbedding work when token usage info is not set #369

Conversation

aniketmaurya commented Nov 27, 2024 • edited Loading

What does this PR do?

PR review

Did you have fun?

codecov bot commented Nov 27, 2024 • edited Loading

Codecov Report

aniketmaurya commented Nov 27, 2024 •

edited

Loading

codecov bot commented Nov 27, 2024 •

edited

Loading