INDEX
Explanations
phrases related to rules and procedures
New Auto-Interp
Neuron Alignment
Index
Value
% of L₁
674
+0.11
0.3%
1253
+0.07
0.2%
1235
+0.07
0.2%
Correlated Neurons
Index
P. Corr.
Cos Sim.
1919
+0.11
0.04
62
+0.07
0.03
526
+0.07
0.03
Negative Logits
BIBSYS
-0.83
uhr
-0.83
maksi
-0.79
meis
-0.76
isoli
-0.75
gmbh
-0.75
lemp
-0.74
levis
-0.73
akku
-0.72
liev
-0.68
POSITIVE LOGITS
ought
0.43
übri
0.42
deserve
0.42
probably
0.42
demek
0.41
probably
0.41
likely
0.40
значит
0.40
raccont
0.40
verdient
0.39
Activations Density 0.309%