-
Introduction of Logistic Regression Classifier: Added a new classification kernel leveraging Logistic Regression for efficient text categorization without the need for fine-tuning.
-
Support for Multiple Pooling Strategies: Implemented various pooling strategies, including
MEAN
,MEAN_MASKED
,MAX
,MAX_MASKED
,CLS
,SUM
, andATTENTION_WEIGHTED
for flexible embedding generation. -
Template and Instruct Models: Introduced support for instruct templates with models like
intfloat/multilingual-e5-large-instruct
to enhance performance by utilizing structured templates. -
Model Export and HuggingFace Integration: Simplified the process of saving and publishing models to HuggingFace with automatic model cards and additional metadata such as tags and languages.
-
Inference Server: Added a dockerized inference server with an HTTP API to facilitate deployment. This includes new scripts for starting the server both in a docker container and on a host machine.
-
Improved Documentation: Updated and expanded documentation, including examples for training models, classification kernels, pooling strategies, model export, and inference.