INDEX
Explanations
positive declarations of readiness or willingness
New Auto-Interp
Neuron Alignment
Index
Value
% of L₁
397
+0.12
0.4%
481
+0.10
0.4%
204
+0.10
0.4%
Correlated Neurons
Index
P. Corr.
Cos Sim.
397
+0.12
0.02
411
+0.10
0.02
812
+0.10
0.02
Negative Logits
serai
-0.58
gubern
-0.54
panik
-0.53
akus
-0.52
maksi
-0.50
minimalis
-0.50
subjek
-0.49
simpel
-0.49
pels
-0.49
kooper
-0.48
POSITIVE LOGITS
ready
1.17
Ready
1.14
READY
1.10
Ready
1.10
ready
1.06
READY
1.02
isReady
0.83
readiness
0.80
Readiness
0.74
onReady
0.68
Activations Density 0.061%