Tags · nubison/nubison-model

v0.0.6

chore: remove useless file link created by test

Jan 16, 2025
1841339
zip
tar.gz
Notes

v0.0.5

feat: add context support for parallel inference (#12)

* feat: enhance model loading with context support

- Updated `load_model` method in `UserModel` and `NubisonModel` to accept a `ModelContext` argument, allowing for better handling of worker-specific information during model loading.
- Introduced `ModelContext` type definition to encapsulate worker index and total number of workers for GPU initialization in parallel setups.
- Adjusted related code in service and tests to accommodate the new context parameter.
- Updated documentation in `README.md` to reflect changes in the `load_model` method and its parameters.

* feat: update Dockerfile to support parallel inference

- Added Open Container Initiative (OCI) labels for better image description and source tracking.
- Introduced a new environment variable `NUM_WORKERS` with a default value of 4 to configure the number of workers for the application.

* refactor: reduce default worker count for improved resource management

- Updated the Dockerfile to change the `NUM_WORKERS` environment variable from 4 to 2, optimizing resource allocation.
- Adjusted the default number of workers in `Service.py` from 4 to 1 to align with the new Docker configuration, enhancing performance and efficiency during model loading.

Jan 2, 2025
8f73ebb
zip
tar.gz
Notes

v0.0.4

feat: uv instead of conda in inference container (#11)

Dec 20, 2024
9575f11
zip
tar.gz
Notes

v0.0.3

fix: test_client fails due to lmodel loading wrong way (#10)

Dec 11, 2024
a02515b
zip
tar.gz
Notes

v0.0.2

doc: update example for better nubison integration

Nov 29, 2024
4315d18
zip
tar.gz
Notes

v0.0.1

feat: add test client for easy test against server (#4)

Nov 13, 2024
587667f
zip
tar.gz

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

v0.0.6

v0.0.5

v0.0.4

v0.0.3

v0.0.2

v0.0.1

Tags: nubison/nubison-model

v0.0.6

v0.0.5

v0.0.4

v0.0.3

v0.0.2

v0.0.1