You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
If your model is not in the above list, we will try to automatically convert the model using
738
746
{func}`~vllm.model_executor.models.adapters.as_classification_model`. By default, the class probabilities are extracted from the softmaxed hidden state corresponding to the last token.
739
747
740
-
#### Sentence Pair Scoring (`--task score`)
748
+
#### Sentence Pair Scoring
749
+
750
+
Specified using `--task score`.
741
751
742
752
:::{list-table}
743
753
:widths: 25 25 50 5 5
@@ -824,7 +834,9 @@ vLLM currently only supports adding LoRA to the language backbone of multimodal
824
834
825
835
See [this page](#generative-models) for more information on how to use generative models.
826
836
827
-
#### Text Generation (`--task generate`)
837
+
#### Text Generation
838
+
839
+
Specified using `--task generate`.
828
840
829
841
:::{list-table}
830
842
:widths: 25 25 15 20 5 5 5
@@ -1200,7 +1212,9 @@ Since some model architectures support both generative and pooling tasks,
1200
1212
you should explicitly specify the task type to ensure that the model is used in pooling mode instead of generative mode.
1201
1213
:::
1202
1214
1203
-
#### Text Embedding (`--task embed`)
1215
+
#### Text Embedding
1216
+
1217
+
Specified using `--task embed`.
1204
1218
1205
1219
Any text generation model can be converted into an embedding model by passing `--task embed`.
1206
1220
@@ -1240,7 +1254,9 @@ The following table lists those that are tested in vLLM.
1240
1254
* ✅︎
1241
1255
:::
1242
1256
1243
-
#### Transcription (`--task transcription`)
1257
+
#### Transcription
1258
+
1259
+
Specified using `--task transcription`.
1244
1260
1245
1261
Speech2Text models trained specifically for Automatic Speech Recognition.
0 commit comments