Model Gallery

4 models from 1 repositories

Filter by type:

Filter by tags:

insightface-opencv
Face recognition using OpenCV Zoo weights: YuNet detector + SFace 128-d recognizer (fp32). APACHE 2.0 — safe for commercial use. Lower accuracy than insightface packs, no demographic head (`/v1/face/analyze` returns detection regions only). Weights are downloaded on install via LocalAI's gallery mechanism (~40MB).

Repository: localaiLicense: apache-2.0

insightface-opencv-int8
Int8-quantized OpenCV Zoo face pair (YuNet int8 + SFace int8, ~12MB). Roughly 3x smaller and noticeably faster on CPU than the fp32 variant at comparable accuracy for face tasks. APACHE 2.0 — commercial-safe. Weights are downloaded on install via LocalAI's gallery mechanism.

Repository: localaiLicense: apache-2.0

speechbrain-ecapa-tdnn
Speaker (voice) recognition with SpeechBrain's ECAPA-TDNN trained on VoxCeleb. 192-d L2-normalised embeddings, ~1.9% Equal Error Rate on VoxCeleb1-O. APACHE 2.0 — commercial-safe. The checkpoint is auto-downloaded from HuggingFace on first LoadModel (no separate weight file in gallery `files:`). Points at the upstream SpeechBrain HF repo directly — same bytes every deployment.

Repository: localaiLicense: apache-2.0

wespeaker-resnet34
Speaker recognition with WeSpeaker's ResNet34 trained on VoxCeleb, exported to ONNX. 256-d embeddings, CPU-friendly — avoids the PyTorch runtime entirely (onnxruntime only). APACHE 2.0. Pair with the `speaker-recognition` backend's OnnxDirectEngine. Use when ECAPA-TDNN's torch dependency is undesirable (small images, edge deployments).

Repository: localaiLicense: apache-2.0