INDEX
Negative Logits
eleph
-0.99
Þ
-0.94
pione
-0.88
exting
-0.86
aditional
-0.81
ò
-0.79
ß
-0.79
ñ
-0.79
enthusi
-0.78
anwhile
-0.77
POSITIVE LOGITS
inx
0.54
't
0.50
igg
0.49
ette
0.48
athan
0.48
itely
0.46
--+
0.46
vals
0.46
ename
0.45
val
0.45
Activations Density 0.662%