INDEX

Explanations

Saving Survival

The neuron fires strongly on instruction‐style action words—especially single verbs that kick off or headline steps (e.g. “changing,” “decompiling,” “rip,” “lean away,” “filter,” “linking,” “vibe”)—marking the start of procedural or how-to directions.

New Auto-Interp

Configuration

Prompts (Dashboard)

392,802 prompts, 256 tokens each

Dataset (Dashboard)

monology/pile-uncopyrighted

Embeds

IFrame

Link

Not in Any Lists

Negative Logits

ir

0.82

ST

0.76

0.73

un

0.72

0.71

 Tunis

0.70

 Nella

0.69

IST

0.68

 Nelson

0.68

al

0.68

POSITIVE LOGITS

 oxidizing

0.89

 lojas

0.87

 wynik

0.86

দিগকে

0.85

 ګټ

0.84

ты

0.83

єд

0.83

চ্ছিলাম

0.83

 사항

0.82

 połą

0.82

Activations Density 0.000%