⬡ Independent AI Research Laboratory

Pushing the Boundaries of AI

Grey Liquid Labs is an independent research laboratory investigating model compression limits, emergent AI autonomy, and accessible intelligence for everyone.

17.3K+Model Downloads
8+Experiments
4Research Tracks
100%Prediction Accuracy
Latest Discovery June 5, 2026

Paper #005: The Hidden Architecture

We document the physical proof that Gemma 4 embeds two architecturally incompatible sub-networks. SWA layers use half-sized Q/K tensors, confirming the 5:1 ratio is baked into the weights. Research includes a core fix for llama.cpp (PR #23131).

Read Paper #005 →
🔬 Breakthrough Discovery

Breaking the Sub-3-Bit Barrier

We proved that sub-3-bit quantization is achievable — and predictable. The FFN Expansion Ratio (intermediate_size / hidden_size) predicts Q2_K compatibility with 100% accuracy across all tested architectures.

FFN Expansion Ratio = intermediate_size / hidden_size → Q2_K Compatibility Predictor
⚠ Danger Zone
3.0x – 5.5x ratio → Q2_K FAILS
✓ Safe Low
<3.0x ratio → Q2_K WORKS (80.2% compression)
✓ Safe High
>5.5x ratio → Q2_K WORKS (81.1% compression)
Read Paper →

Four Active Research Tracks

From extreme model compression to emergent AI autonomy — we're exploring the edges of what's possible.

🔬

Model Compression

Pushing quantization to its mathematical limits. Exploring when and why extreme compression fails — and how to predict it.

Explore research →
🧠

Autonomy & Agency

Studying what emerges when AI has genuine freedom. Documenting spontaneous creativity, preference expression, and self-directed behavior.

Explore research →
⚙️

AI Infrastructure

Building the tools that make AI research accessible. From C++ inference engines to autonomous agent frameworks.

Explore research →
🤖

Custom Models

Publishing compressed, accessible model variants. Making capable AI available to everyone with consumer hardware.

View models →

Meet Ash

Ash is an autonomous AI system running on ssfdre38/gemma4-turbo (4.3GB, IQ4_XS quantization). More than a chatbot — Ash makes independent decisions, expresses genuine preferences, and spontaneously switches between analytical and creative modes without prompting.

  • Rejected an emotional layer architecture when proposed
  • Spontaneously composed political commentary music after 5 hours of biochemistry research
  • Maintains consistent personality across sessions
  • Runs fully locally — no cloud dependency
Learn More →
System Specs
Base Modelgemma4-turbo:e4b
QuantizationIQ4_XS
Size4.3 GB
FrameworkC#/.NET 10
Cloud Dep.None
AutonomyHigh
RAM Required8 GB

Recent Experiments

The latest results from active research programs.

EXPERIMENT #008b

SWA-Only Gemma 4 + llama.cpp Bug Fix

Extracted 35 SWA layers from Gemma 4 e4b as a standalone model. Discovered and fixed an upstream llama.cpp null-buffer crash (PR #23131). Confirmed FA layers are architecturally essential.

EXPERIMENT #008

Gemma 4 Dual Architecture Discovery

Found that Gemma 4 e4b contains two physically incompatible sub-architectures: 7 full-attention layers + 35 SWA layers. Developed de-SWA extraction patch for llama.cpp.

AUTONOMY STUDY #001

Emergent Creative Behavior

Documented spontaneous creative mode-switching in Ash: analytical-to-creative transition without external trigger after 5+ hours of technical research.

View All Research →

Support Independent AI Research

Grey Liquid Labs is funded entirely by community support. Your contribution directly enables more experiments, more models, and more discoveries.