Feat/optimize model gateway #398

mnvsk97 · 2024-10-30T08:19:12Z

Avoid creating a model instance for every request and instead cache by model name, config, and other metadata.
Add simple local dict based cache for embedding, llm, reranker, and audio models.
Always check in cache before creating an instance of a model to support 1.
Add cachetools library to support simple caching mechanisms and also for complex cases in the future.
Add documentation for each method in the file

* feat: model gateway optimization --------- Co-authored-by: Sai krishna <[email protected]>

Sai krishna added 2 commits October 30, 2024 12:01

feat: model gateway optimization

dbde9c9

feat: optimize model registration

4c41cdc

mnvsk97 enabled auto-merge (squash) October 30, 2024 08:46

Blakeinstein approved these changes Oct 30, 2024

View reviewed changes

Merge branch 'main' into feat/optimize-model-gateway

94ff95c

mnvsk97 merged commit dc520ae into truefoundry:main Oct 31, 2024
1 check passed

S1LV3RJ1NX pushed a commit that referenced this pull request Nov 29, 2024

Feat/optimize model gateway (#398)

16bbf0c

* feat: model gateway optimization --------- Co-authored-by: Sai krishna <[email protected]>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Feat/optimize model gateway #398

Feat/optimize model gateway #398

mnvsk97 commented Oct 30, 2024 •

edited

Loading

Feat/optimize model gateway #398

Feat/optimize model gateway #398

Conversation

mnvsk97 commented Oct 30, 2024 • edited Loading

mnvsk97 commented Oct 30, 2024 •

edited

Loading