xinference.client.Client.describe_model#
- Client.describe_model(model_uid: str)[source]#
Get model information via RESTful APIs.
- Paramètres:
model_uid (str) – The unique id that identify the model.
- Renvoie:
A dictionary containing the following keys:
- »model_type »: str
the type of the model determined by its function, e.g. « LLM » (Large Language Model)
- »model_name »: str
the name of the specific LLM model family
- »model_lang »: List[str]
the languages supported by the LLM model
- »model_ability »: List[str]
the ability or capabilities of the LLM model
- »model_description »: str
a detailed description of the LLM model
- »model_format »: str
the format specification of the LLM model
- »model_size_in_billions »: int
the size of the LLM model in billions
- »quantization »: str
the quantization applied to the model
- »revision »: str
the revision number of the LLM model specification
- »context_length »: int
the maximum text length the LLM model can accommodate (include all input & output)
- Type renvoyé:
dict
- Lève:
RuntimeError – Report failure to get the wanted model with given model_uid. Provide details of failure through error message.