An efficient retrieval mechanism to chat specifically with a technical document using the OpenAI API. Embeddings of relevant snippets are created using the state-of-the-art (at that point) INSTRUCTOR model, followed by a dimensionality reduction using Uniform Manifold Approximation and Projection (UMAP). Finally, the embeddings are clustering into buckets of similar content using HDBSCAN. This system aims to solve hard problems like answering indirect query questions and looking up relevant references.
-
Notifications
You must be signed in to change notification settings - Fork 0
jashvira/ChatPDF
Folders and files
Name | Name | Last commit message | Last commit date | |
---|---|---|---|---|
Repository files navigation
About
Chat with any research paper, using OpenAI API and a novel retrieval algorithm.
Resources
Stars
Watchers
Forks
Releases
No releases published
Packages 0
No packages published