AI Engine

TinyCPUDevAI

Nicknamed "Tiny"

TinyCPUDevAI is the model and runtime layer underneath this project’s AI behavior. It is not just a prompt wrapper around somebody else’s hosted model. It is a custom MinGRU-based local system with recurrent hidden state, exportable inference backends, quantized runtime paths, retrieval-aware prompt assembly, shared blackboard coordination, and a training stack built to survive real-world iteration instead of demo-only happy paths.

Commission a Custom Build Contact Us

O(1)

Per-Step Recurrent Memory

Inference Backends

INT4

Weight-Only Fast Path

RAG

Workspace + Knowledge Retrieval

Shared Blackboard Workflow

100%

Local Runtime Control

Core Architecture

A Local AI Stack With Real Engineering Behind It

TinyCPUDevAI is impressive for the same reason Portal IDE is impressive: the capabilities are wired into code paths you can inspect. The model, runtime, retrieval, and training improvements here are not abstract aspirations. They are implementation details that directly change how the system behaves.

🧠

MinGRU Neural Engine

Built around the MinGRU update rule from "Were RNNs All We Needed?" so the recurrent core stays lean while preserving persistent hidden-state behavior. The implementation uses numerically stable log-space gate computation, which matters because it keeps the model usable over longer sequences and touchier training conditions.

⚡

Triton Fused CUDA Path

On NVIDIA hardware, Tiny can dispatch fused Triton kernels for MinGRU recurrence instead of staying in the plain Python/PyTorch loop. That gives the project a serious performance ceiling without making Triton a hard dependency, because the runtime still falls back safely when CUDA or Triton are unavailable.

🚀

Exportable Runtime Backends

Tiny is not locked to one execution path. The repo supports the default PyTorch runtime, ONNX Runtime export/load, and an OpenVINO path compiled from ONNX. That makes the model more deployable across different local hardware targets and gives the project room to optimize for the machine it is running on.

📈

Retrieval-Aware Prompt Assembly

Tiny does not rely on raw prompt text alone. The runtime can pull in workspace retrieval, knowledge RAG, memory recall, history context, diagnostics, selected text, and blackboard state before answering, which makes the system far more grounded than a stateless local text generator.

🔢

Quantization Without Throwing Away the Fallback

The MinGRU layers support per-group symmetric INT4 quantization while preserving the original floating-point weights for compatibility and fallback behavior. That is a practical design choice: Tiny can chase lower bandwidth and faster inference without making the model brittle to serialize, reload, or downgrade.

🔎

Embeddings and Self-Referential RAG

Tiny exposes embeddings and uses them in retrieval flows, including answering from its own code and local knowledge stores. When the model can cite its own implementation and active workspace instead of only leaning on generic priors, it becomes more useful as an engineering assistant and easier to trust during real debugging.

Interactive Explainer

Why MinGRU Fits Local AI Better

This comparison used to live in Labs. It belongs here now, because it explains the model tradeoff directly: TinyCPUDevAI is optimized for local-first reasoning where predictable recurrent state and realistic hardware behavior matter more than chasing a cloud-style attention budget.

What Changes Locally

✓ Recurrent state stays predictable as sessions get longer
✓ Local deployments map more honestly to real workstation constraints
✓ The architecture reinforces the privacy-first product story instead of fighting it

Why It Matters

✓ Easier to justify offline and private deployments
✓ Stronger answer to why this is not just a hosted wrapper
✓ Better fit for hardware ranges that real developers actually own

Inside Portal IDE

What Tiny Actually Powers

TinyCPUDevAI is the AI system underneath the product surface. The Portal IDE page owns demos and platform comparisons; this page stays focused on the intelligence layer itself and the specific places it shows up inside the IDE.

💬

Code Assistance and Refactoring

Tiny drives local code generation, file-aware edits, tool-assisted implementation loops, and repair suggestions. The important difference is that those outputs can be fed by real workspace evidence rather than a generic autocomplete-only context window.

⚖️

Draft and Synthesis Workflows

The reasoning flow supports staged drafting and refinement with a shared blackboard, memory bundle reuse, and stricter answer shaping rules. That means Tiny can participate in a real multi-pass workflow instead of pretending the first response is always the best one.

🎓

Planner and Goal-Loop Handoffs

Tiny is also part of the planner and goal-loop ecosystem: planning prompts, transient profile runs, and workspace-scoped task progression can all feed into the same local reasoning substrate rather than creating yet another disconnected automation layer.

🧠

Memory and Knowledge Retrieval

Persistent memory, history retrieval, workspace RAG, knowledge RAG, and self-indexed code understanding give Tiny a better shot at staying relevant to the active repo and the user’s recent work instead of answering from detached background knowledge alone.

See the full Portal IDE product surface

Reasoning Runtime

Deliberation With Shared Working Memory

Tiny’s reasoning story is stronger now because the runtime is more disciplined. The model can draft, refine, critique, and verify while sharing retrieval context and blackboard artifacts across stages.

⚖️

Structured Prompt Contract

The shared reasoning prompt contract enforces repo-grounded behavior, explicit uncertainty handling, plan-execute-verify rhythms, and reconciliation against logs and touched files. That matters because model quality is partly architecture and partly the discipline of the runtime wrapped around it.

🔮

Shared Blackboard

Draft code, test snippets, review notes, and candidate-ready outputs can be written to a shared blackboard so later stages do not need to re-invent the same context from scratch. It is a small idea with a big practical effect on coherence.

🧠

Tool and Evidence Routing

Tiny can operate with tool categories spanning generation, embeddings, local tools, kernel relays, retrieval, diagnostics, Git, patching, and validation support. The real power here is not “agent vibes,” it is that reasoning can pull from evidence and then act on the workspace with guardrails.

user request
  -> gather workspace + knowledge + memory context
  -> draft candidate path
  -> write intermediate artifacts to the shared blackboard
  -> critique against repo rules and evidence
  -> synthesize one action plan
  -> emit final answer with validation-aware rationale

Training Runtime

A Training Stack That Tries Hard Not to Waste Your Night

One of the biggest improvements in this repo is not only model quality, but training survivability. Tiny’s pipeline now does far more to expose progress, reuse work, isolate artifacts, and recover when startup or tokenization gets rough.

🔬

Pretokenization Planner

The pretokenization path now resolves its real execution mode up front, supports packed and non-packed flows, emits visible heartbeats, and has watchdog coverage for long stalls. That makes startup behavior far easier to observe and much less likely to disappear into silent dead time.

🩺

Token Cache Reuse

Verified token caches are content-addressed by corpus signature, tokenizer contents, sequence settings, and run nonce so unchanged corpora can be reused safely. That is one of the most practical upgrades in the project because it turns expensive preprocessing into an asset instead of a repeated tax.

🛡️

EWC and Checkpoint Retention

The training stack includes checkpointing, retention logic, and EWC-aware fine-tuning support so continued training can preserve more of what the model already knows instead of treating every run like a blind reset.

⚔️

Hardware-Aware Optimization

Training can detect bf16 support, auto-tune gradient-checkpoint segments, use mixed precision where appropriate, and shift strategy based on actual hardware capability instead of pretending every machine has identical tolerance for bigger runs.

🌟

Recovery-Oriented Launches

Bootstrap checkpoints, tokenizer bootstrap fallbacks, resume-policy controls, and monitor-managed watchdog behavior help runs recover more gracefully when artifacts are missing or startup paths drift out of alignment.

🌍

Training Monitor Visibility

The monitor now mirrors colored training output, shows variant-local state, tracks checkpoints and corpus options, exposes a no-resume path, and keeps long-running startup phases visible in one place. That makes Tiny easier to operate, not just easier to describe.

Knowledge and Retrieval

Context That Is Broader Than the Current Prompt

Tiny is much more than a next-token loop. The runtime now has multiple ways to gather context and keep it useful across sessions, repos, and training cycles.

Retrieval Architecture

✓ Workspace RAG for active source files and local project structure
✓ Knowledge RAG over curated local corpus files
✓ Memory recall bundles for short-form persistent conversational context
✓ History retrieval so past file changes can inform current repairs
✓ Embedding generation plus scoring-based fallback paths
✓ Web-assisted critique when local evidence alone is not enough

Corpus and Cache Discipline

✓ Corpus scanning with tagging, quality summaries, and cache signatures
✓ Automatic reuse of valid pretokenized caches across matching runs
✓ Incremental tokenization and compatibility checks to avoid stale reuse
✓ Packed-document startup support when it helps throughput
✓ Non-packed fallback paths when a giant startup window would stall
✓ Variant-local artifacts so one model family does not corrupt another's runtime state

Product Fit

Why This Page Stays Focused on the AI Layer

Platform demos, model explainers, and architecture proof now live across the Portal IDE and TinyCPUDevAI pages directly. This page stays concentrated on the neural architecture, reasoning stack, memory, and deployment model behind the AI itself.

🧪

Architecture Here

MinGRU internals, training behavior, reasoning layers, and knowledge lifecycle belong on this page because they describe the model and runtime directly.

🖥️

Platform Proof Elsewhere

Portal IDE owns the platform demo, validation proof, and product comparison surface so the same evidence is not repeated in multiple places.

🔗

One Connected Story

TinyCPUDevAI is not a separate marketing claim. It is the intelligence substrate that powers Portal IDE and custom offline deployments, with the explainer material now folded into the product pages that actually use it.

Open Portal IDE proof surface

Custom Builds

Completely Offline. Data-Protecting. High-Security.

TinyCPUDevAI is not available for general sale. What we offer instead is something more valuable: a bespoke, completely offline, data-protecting AI build — designed precisely for your organisation’s requirements, deployed entirely within your infrastructure, and guaranteed to never expose your data to any third party, network, or cloud service.

TinyCPUDevAI is not currently available for general sale.
We accept commissions for fully custom, completely offline, data-protecting, high-security AI builds for businesses and individuals with elevated privacy, security, or compliance requirements. Every custom build is delivered with full deployment support, documentation, and optional integration into Portal IDE or your existing development toolchain.

🏢

Domain-Specific Intelligence

Your custom TinyCPUDevAI is trained exclusively on your internal codebase, documentation, and proprietary frameworks. It learns your architecture patterns, naming conventions, API contracts, and domain vocabulary — not generic internet code. The result is an AI that responds like a senior member of your team.

🔒

Air-Gapped & On-Premise Deployment

Designed for complete network isolation. No telemetry, no cloud calls, no external API dependencies at runtime — ever. Deployable on fully air-gapped networks, secure government environments, regulated financial infrastructure, or any organisation where data sovereignty is non-negotiable.

⚙️

Tailored Integration & Delivery

Integrate into Portal IDE, deploy as a standalone service, or embed into your existing toolchain. We handle custom tool registries, bespoke training pipelines, tailored security policies, and full deployment documentation — all built to your exact specification with hands-on support.

Commission a Custom Build

Tell us about your organisation, your use case, your infrastructure requirements, and any compliance constraints. We will scope a completely offline, data-protecting TinyCPUDevAI solution built precisely for your needs.

✉ Enquire About a Custom Build

MythlogicStudios@outlook.com · Response within 24–48 hours

Interested in a Custom Build?

TinyCPUDevAI is not available for general sale — we build bespoke, completely offline, data-protecting AI solutions tailored to your specific requirements. Contact us to start the conversation.

✉ Enquire About a Custom Build

Also see: Portal IDE · Proof Surface · Monster Maneuvers