INDEX
Explanations
architecture
The neuron fires on mentions of tourist‐attraction terms—especially “architecture” and “landmarks.”
New Auto-Interp
Negative Logits
robotic
-0.07
Serv
-0.06
mHandler
-0.06
solving
-0.06
logging
-0.06
.Flow
-0.06
filthy
-0.06
bots
-0.06
pending
-0.06
Bowman
-0.06
POSITIVE LOGITS
.ip
0.07
ичні
0.07
δή
0.06
$(
0.06
Secrets
0.06
OW
0.06
ดาว
0.06
SHALL
0.06
VIEW
0.06
《
0.06
Activations Density 0.009%