INDEX
Explanations
references to specific geographical locations, events, and groups
New Auto-Interp
Neuron Alignment
Index
Value
% of L₁
1150
+0.12
0.4%
227
+0.12
0.3%
764
+0.09
0.3%
Correlated Neurons
Index
P. Corr.
Cos Sim.
227
+0.12
0.07
648
+0.12
0.04
113
+0.09
0.04
Negative Logits
middels
-0.81
écout
-0.78
télécharge
-0.74
évit
-0.73
considér
-0.72
regardant
-0.71
décid
-0.70
Mittwoch
-0.67
appréci
-0.67
Donnerstag
-0.65
POSITIVE LOGITS
Interpreting
0.51
errone
0.51
Evaluations
0.49
Causal
0.47
Occurrence
0.46
sophistic
0.46
Ltd
0.46
evoc
0.45
depic
0.45
Impaired
0.45
Activations Density 0.256%