LocalAI - Models

insightface-opencv

Face recognition using OpenCV Zoo weights: YuNet detector + SFace 128-d recognizer (fp32). APACHE 2.0 — safe for commercial use. Lower accuracy than insightface packs, no demographic head (`/v1/face/analyze` returns detection regions only). Weights are downloaded on install via LocalAI's gallery mechanism (~40MB).

Links

https://github.com/opencv/opencv_zoo

Tags

insightface-opencv-int8

Int8-quantized OpenCV Zoo face pair (YuNet int8 + SFace int8, ~12MB). Roughly 3x smaller and noticeably faster on CPU than the fp32 variant at comparable accuracy for face tasks. APACHE 2.0 — commercial-safe. Weights are downloaded on install via LocalAI's gallery mechanism.

Links

https://github.com/opencv/opencv_zoo

Tags

speechbrain-ecapa-tdnn

Speaker (voice) recognition with SpeechBrain's ECAPA-TDNN trained on VoxCeleb. 192-d L2-normalised embeddings, ~1.9% Equal Error Rate on VoxCeleb1-O. APACHE 2.0 — commercial-safe. The checkpoint is auto-downloaded from HuggingFace on first LoadModel (no separate weight file in gallery `files:`). Points at the upstream SpeechBrain HF repo directly — same bytes every deployment.

Links

Tags

wespeaker-resnet34

Speaker recognition with WeSpeaker's ResNet34 trained on VoxCeleb, exported to ONNX. 256-d embeddings, CPU-friendly — avoids the PyTorch runtime entirely (onnxruntime only). APACHE 2.0. Pair with the `speaker-recognition` backend's OnnxDirectEngine. Use when ECAPA-TDNN's torch dependency is undesirable (small images, edge deployments).

Links

https://github.com/wenet-e2e/wespeaker

Tags

Model Gallery

Filter by type:

Filter by tags:

insightface-opencv

insightface-opencv-int8

speechbrain-ecapa-tdnn

wespeaker-resnet34