INDEX
Explanations
family-related words and phrases
New Auto-Interp
Neuron Alignment
Index
Value
% of L₁
394
+0.08
0.2%
612
+0.07
0.2%
1967
+0.07
0.2%
Correlated Neurons
Index
P. Corr.
Cos Sim.
862
+0.08
0.02
513
+0.07
0.03
1093
+0.07
0.04
Negative Logits
idr
-1.06
reluct
-1.04
„,
-1.04
maneu
-1.04
kask
-1.02
gmbh
-1.00
effe
-0.99
mef
-0.97
wien
-0.97
socie
-0.97
POSITIVE LOGITS
killed
0.57
attending
0.54
registered
0.52
enrolled
0.52
involved
0.51
abroad
0.50
who
0.49
whom
0.49
deceased
0.48
тоже
0.47
Activations Density 0.362%