INDEX
Negative Logits
$_"
-1.02
Efq
-1.02
itſelf
-1.01
myſelf
-0.95
Diſ
-0.94
estekak
-0.93
Wib
-0.92
'])){
-0.91
-------------</
-0.91
raiſ
-0.91
POSITIVE LOGITS
shall
0.91
shall
0.86
Shall
0.78
SHALL
0.73
Shall
0.69
hath
0.69
Hansen
0.68
phors
0.65
momile
0.63
secas
0.61
Activations Density 0.005%