Vigilio — Independent research on agentic AI security and system evolution

Toward AI systems that are secure, frugal, and governable as they evolve.

Vigilio is an independent research practice studying the control plane of agentic AI: how to constrain autonomous agents under adversarial conditions, how experience-driven memory can replace brute-force token spend, and how meta-control frameworks can keep self-modifying systems aligned over time.

Founded

2025 · Taiwan

Practice

Independent research lab

Focus

Agentic AI security · Evolutionary control

Lineage

UC Berkeley · OWASP · Armorize / Proofpoint

RA · 01

Agentic AI Security Control

Threat Modeling Adversarial ML Tool-use Containment

Today's agentic systems combine LLM reasoning, tool use, persistent memory, and multi-agent delegation — a surface area that traditional AppSec was never designed to cover. Our work formalises control objectives for agent runtimes: blast-radius bounds for tool-calls, integrity guarantees on memory, and attestation across agent-to-agent hand-offs.

Current investigations include defenses against prompt-injection chains via RAG, model inversion in fine-tuned domain models, data poisoning in continual-learning pipelines, and evasion attacks on inference platforms (vLLM, Hugging Face, Slurm-scheduled jobs).

RA · 02

Experience-Driven Token Economy

Memory Architectures Inference Cost Continual Learning

LLM cost today scales with context length; intelligence does not. We study how to replace recomputation with accumulated experience: structured episodic memory, distilled procedural skills, and verifiable retrieval that lets an agent answer a recurring class of questions without re-paying for the reasoning each time.

The goal is a measurable reduction in tokens-per-decision on production workloads, without sacrificing factuality — a prerequisite for any agentic system that must run continuously inside an enterprise budget.

RA · 03

Meta-Control of AI System Evolution

Governance Self-Modifying Systems Alignment

Modern AI stacks rewrite themselves: weights are updated, prompts are rewritten by other prompts, agents spawn agents. Single-layer policy is insufficient. We are developing a meta-control approach that treats the AI system itself as the object of governance — explicit invariants on the trajectory of change, with audit trails that survive across model swaps and platform migrations.

The framing draws on classical control theory, software supply-chain integrity, and threat modeling adapted for systems whose own behaviour is the deployment artefact.

Toward AI systems that are secure, frugal, and governable as they evolve.

Three threads of work, one underlying question:
how do we keep increasingly autonomous AI systems safe, efficient, and answerable?

Agentic AI Security Control

Experience-Driven Token Economy

Meta-Control of AI System Evolution

From theory to
operational control.

Research practice over product roadmap.

Threat models before tooling.

Independence over scale.

End-to-end, not slide-deep.

Working notes, technical disclosures, and prior art.

Lab of one, work of many years.

Considered correspondence is welcomed.

Three threads of work, one underlying question:how do we keep increasingly autonomous AI systems safe, efficient, and answerable?

Agentic AI Security Control

Experience-Driven Token Economy

Meta-Control of AI System Evolution

From theory tooperational control.

Research practice over product roadmap.

Threat models before tooling.

Independence over scale.

End-to-end, not slide-deep.

Working notes, technical disclosures, and prior art.

Lab of one, work of many years.

Considered correspondence is welcomed.

Three threads of work, one underlying question:
how do we keep increasingly autonomous AI systems safe, efficient, and answerable?

From theory to
operational control.