Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Add configuration options for changing embedding model name and NIM endpoint url at runtime #440

Open
wants to merge 11 commits into
base: main
Choose a base branch
from

Conversation

ChrisJar
Copy link
Collaborator

@ChrisJar ChrisJar commented Feb 13, 2025

Description

This PR adds model name, and NIM endpoint url configuration options to the embed task so that users can configure different embedding NIMs/models at runtime. If these configuration options aren't set it falls back to the current behavior of pulling the model name from an env variable when starting the container.

Checklist

  • I am familiar with the Contributing Guidelines.
  • New or existing tests cover these changes.
  • The documentation is up to date with these changes.

Copy link

copy-pr-bot bot commented Feb 13, 2025

This pull request requires additional validation before any workflows can run on NVIDIA's runners.

Pull request vetters can view their responsibilities here.

Contributors can view more details about this message here.

@ChrisJar ChrisJar marked this pull request as ready for review February 14, 2025 01:07
@ChrisJar ChrisJar marked this pull request as draft February 14, 2025 18:11
@ChrisJar ChrisJar marked this pull request as ready for review February 15, 2025 00:33
@ChrisJar ChrisJar requested a review from a team as a code owner February 15, 2025 00:33
@ChrisJar ChrisJar requested review from jperez999, jdye64, edknv and drobison00 and removed request for a team, jperez999, jdye64, edknv and drobison00 February 15, 2025 00:33
@drobison00
Copy link
Collaborator

Looks good; sorry, the recent unification updates introduced some differences that break some of these issues. Happy to sync on it and help merge.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants