INDEX
Explanations
phrases related to imprisonment or detainment
New Auto-Interp
Neuron Alignment
Index
Value
% of L₁
899
+0.13
0.4%
484
+0.11
0.3%
908
+0.11
0.3%
Correlated Neurons
Index
P. Corr.
Cos Sim.
802
+0.13
0.06
908
+0.11
0.04
899
+0.11
0.05
Negative Logits
vuol
-0.60
gouver
-0.57
gius
-0.56
ricor
-0.56
cresce
-0.55
Rhestr
-0.55
ьаж
-0.54
affez
-0.54
vernac
-0.53
IndentedString
-0.51
POSITIVE LOGITS
unspeak
0.97
gaily
0.91
unwarran
0.88
tolerably
0.86
ingrat
0.85
apprehen
0.85
indescri
0.83
wanderer
0.83
withal
0.83
outlander
0.80
Activations Density 0.318%