INDEX
Negative Logits
kasarigan
-1.09
فريبيس
-0.94
reaſon
-0.90
itſelf
-0.90
uſe
-0.89
themſelves
-0.87
Theſe
-0.87
purpoſe
-0.86
uſed
-0.86
pleaſure
-0.86
POSITIVE LOGITS
y
0.81
e
0.77
Charlie
0.73
Charlie
0.67
Lang
0.66
&
0.65
um
0.65
McIntyre
0.63
AssignableFrom
0.63
tem
0.62
Activations Density 0.017%