INDEX
Explanations
punctuation
The neuron detects uppercase initialisms or acronyms (multi-letter all-caps abbreviations).
New Auto-Interp
Negative Logits
Northwestern
-0.07
=================================
-0.07
shifted
-0.07
Cooke
-0.07
Fra
-0.07
//////
-0.06
Covenant
-0.06
commas
-0.06
attacked
-0.06
prolonged
-0.06
POSITIVE LOGITS
prm
0.06
��
0.06
)p
0.06
ýš
0.06
önemlidir
0.06
oxide
0.06
politic
0.06
.IsAny
0.06
SuppressLint
0.06
0.06
Activations Density 0.018%