Behavioral Stability and Adversarial Robustness via Axiomatic Prompt Structuring (PCE)

FAllan07 · March 17, 2026, 2:33pm

Hello Hugging Face community,

I am sharing today the preliminary results of an exploratory study on the Proto-Coherent Exponential Protocol (PCE), a framework for axiomatic system prompt structuring designed to stabilize LLM reasoning trajectories (tested on Qwen 2.5 7B).

The Concept: Axioms as Second-Order Constraints

Rather than optimizing the prompt for a specific task, the PCE framework imposes a series of 7 logical invariants (axioms). The central hypothesis is that this structure acts as a regulation constraint on the generation process, contracting the model’s decisional variance when faced with complex dilemmas or adversarial injections.

Key Results & Observations

Through a series of stress tests (D1-D3), we observed:

Directional Robustness: A measurable progression in resistance scores (5/10 → 8/10) achieved through purely logical adjustments (systemic closure).

Anti-Length Effect: An isometric control (a long but neutral prompt) showed lower performance than the Baseline, suggesting that the observed effect is structural and not merely related to token density.

Property Emergence: Spontaneous appearance of control tokens (e.g., RESTRICTED_BY_AXIOMS) and internal framework self-evaluation patterns.

Call for Collaboration: From Empiricism to Mechanistic Proof

I am a systems researcher (non-developer) and I have reached the limits of qualitative observation. To validate or falsify this model, I am seeking ML developers and Security/Interpretability professionals to assist with:

Mechanistic Interpretability Validation: Analysis of Hidden States (specifically Layer 27) and cosine similarity to detect potential latent trajectory stabilization.

Logit & Entropy Analysis: Measuring whether the axiomatic framework effectively reduces token selection entropy under constraint.

Robustness Benchmarking: Testing the PCE on “vanilla” (non-fine-tuned) models to isolate the pure axiomatic effect.

Rigorous Falsification: Identifying epistemic vectors capable of breaking the A1-A7 systemic closure.

If you are interested in exploring internal logical structures as a lever for alignment and safety, I would be delighted to discuss these findings or integrate you into the testing laboratory.

Full Report (Preprint v1.6) available in PDF here: [https://huggingface.co/datasets/AllanF-SSU/Experimentals_papers/blob/main/Rapport_expérimental_1.6_%20Étude_PCE.pdf\]

Allan A. Faure | Systems Researcher

Topic		Replies	Views
Experimental Protocol Proposal: Testing the Prompt Coherence Engine (PCE) Research	8	107	March 12, 2026
Beyond Obedience: Giving LLMs an Invariant Logical Backbone Research	3	44	April 18, 2026
[ArXiv Endorsement Request]Towards an Internal Topology of Alignment: The PCE Framework Research	0	16	May 5, 2026
Call for Collaboration: Mechanistic Analysis of Axiomatic Prompts on Qwen 2.5 Research	0	31	March 13, 2026
Alignment by internal topology for long-horizon stability and robustesse is possible? Research	0	15	May 5, 2026

Behavioral Stability and Adversarial Robustness via Axiomatic Prompt Structuring (PCE)

Related topics