Name: Shakti Foundation Models
Brand: SandLogic
Availability: InStock

Question 1

What is the Shakti family of language models?

Accepted Answer

Shakti is SandLogic's family of sovereign small language models (SLMs) — six models from 100M to 4B parameters, published as open weights on HuggingFace. Designed for edge deployment, the Shakti family is engineered to outperform peers 2-3× its size on production benchmarks. Shakti models cover transformer-class architectures (the family) and extend to vision-language (Shakti-VLM 1B and 4B) and state-space architectures (Samba-ASR uses Mamba). All models are designed to ship on Krsna SoC silicon and to run on the EdgeFlow inference engine for non-Krsna targets.

Question 2

How does Shakti compare to Llama, Qwen, or Phi?

Accepted Answer

Shakti models are engineered to outperform peers 2-3× their parameter count on production enterprise workloads. Shakti-VLM (1B and 4B) outperforms Qwen2-VL-7B on document and chart understanding benchmarks. The Shakti family is designed specifically for edge deployment and on-prem enterprise inference — a different optimization target than frontier-scale general-purpose models from Meta (Llama), Alibaba (Qwen), or Microsoft (Phi). When right-sizing the model for a workload, Shakti often replaces a much larger model at 50-100× lower per-token cost.

Question 3

Where can I download Shakti models?

Accepted Answer

The Shakti family is published as open weights on HuggingFace at huggingface.co/SandLogicTechnologies. Six base models range from 100M to 4B parameters. Shakti-VLM (1B and 4B) is the vision-language variant. Samba-ASR (Mamba architecture) is the speech variant. All models include model cards with training data summaries, evaluation results, and usage examples. For commercial deployment with EdgeMatrix runtime acceleration and on-prem support, contact sales@sandlogic.com.

Question 4

Does SandLogic support Mamba and state space models?

Accepted Answer

Yes. Mamba and state-space models are first-class architectures in the SandLogic stack. Samba-ASR (SandLogic's Mamba-based speech-recognition model) achieves a −51% average WER reduction versus Whisper-large-v3, published as arXiv 2501.02832. The EdgeFlow inference engine handles Mamba and Mamba-2 in production across NVIDIA, AMD, Intel, and ARM silicon. The Krsna SoC supports state-space models as one of its four production model families. Jamba (Mamba+Transformer hybrid) is supported by the CORE dispatch layer.

Question 5

What are Lexicons and Nexons?

Accepted Answer

Lexicons and Nexons are sovereign language model families published by SandLogic alongside the Shakti family. Lexicons are hosted as open weights on HuggingFace. Together with Shakti, the three families form the SandLogic foundation model portfolio — right-sized for different deployment envelopes, all designed to run efficiently on the EdgeFlow inference engine and natively on Krsna silicon.

Question 6

Are Shakti models suitable for Indian languages?

Accepted Answer

Yes. The Shakti foundation is engineered for multilingual deployment with strong coverage of Indic languages. The SandLogic application stack (Lingo speech analytics) supports 22 Indic languages in production today, with Shakti-class models doing the underlying reasoning. For enterprise voice agents (IRA), the combined Shakti + Sruthi (ASR) + Svara (TTS) stack handles multilingual conversational AI on-device or on-prem.

A continuum of intelligence.
Open knowledge, refined bridges,
original innovation.

From open knowledge to original innovation.

Lexicons

Nexons

Shakti

Open-source models, made enterprise-ready.

Curated

Quantized

Permissive

Runtime-ready

Open foundations, sharpened with our data.

Sharper performance

Trust by construction

Domain relevance

First Nexon releases shortly.

One family. Every parameter range.

From wearables to frontier — log-scaled.

3× smaller.
Match for match.

MMLU · SocialQA · TruthfulQA — head to head.

Vision-language at a fraction of the size.

Document understanding — at 4B parameters.

QK-Normalization

Hybrid normalization

Three-stage training

Pick the model that fits your problem.

A continuum of intelligence.Open knowledge, refined bridges,original innovation.