INDEX
Explanations
references to measurements or quantities, especially expressed in metric terms like millimeters or milliliters
New Auto-Interp
Neuron Alignment
Index
Value
% of L₁
1937
+0.20
1.0%
1671
+0.19
1.0%
1482
+0.17
0.9%
Correlated Neurons
Index
P. Corr.
Cos Sim.
1343
+0.20
0.11
678
+0.19
0.10
1577
+0.17
0.10
Negative Logits
earnestness
-0.77
rehensive
-0.73
<bos>
-0.73
impelled
-0.71
endeavored
-0.70
Mmmm
-0.70
ardor
-0.68
Bullshit
-0.68
Hahahahaha
-0.67
Derp
-0.66
POSITIVE LOGITS
klassi
0.92
kompati
0.89
katastro
0.84
rafraî
0.83
flé
0.82
iI
0.80
Bakter
0.79
kritis
0.79
Schrö
0.77
autenti
0.77
Activations Density 1.020%