INDEX
Explanations
sentences emphasizing immediate actions or events
New Auto-Interp
Neuron Alignment
Index
Value
% of L₁
405
+0.13
0.4%
1265
+0.13
0.4%
1272
+0.11
0.4%
Correlated Neurons
Index
P. Corr.
Cos Sim.
405
+0.13
0.03
1272
+0.13
0.03
1222
+0.11
0.03
Negative Logits
guma
-0.79
pinak
-0.62
tanong
-0.61
loob
-0.56
kanya
-0.54
walang
-0.54
pama
-0.53
maraming
-0.51
essentiels
-0.50
ibang
-0.50
POSITIVE LOGITS
immediate
0.81
immediately
0.79
Immediate
0.78
pamph
0.77
Immediately
0.75
IMMEDIATE
0.74
Immediately
0.71
immedi
0.71
dramatist
0.70
immediately
0.69
Activations Density 0.101%