INDEX
Negative Logits
ahime
-0.77
IJ
-0.76
rient
-0.72
gnu
-0.71
rients
-0.70
cv
-0.69
acle
-0.69
ãĤ¹ãĥĪ
-0.68
Democr
-0.64
isol
-0.63
POSITIVE LOGITS
farewell
0.96
goodbye
0.82
ewater
0.73
auctions
0.72
erville
0.69
whale
0.68
warm
0.67
selves
0.66
auction
0.66
aloud
0.65
Activations Density 0.198%