INDEX
Negative Logits
disambiguazione
-1.00
bill
-0.91
ujednoznacz
-0.75
bill
-0.65
meal
-0.64
:✨
-0.59
Bill
-0.58
Espèce
-0.56
WEBPACK
-0.55
Meal
-0.55
POSITIVE LOGITS
pleaſure
0.77
leſs
0.70
Shakspeare
0.69
age
0.68
able
0.68
myſelf
0.66
itſelf
0.65
houſe
0.64
ſmall
0.64
faſt
0.63
Activations Density 0.092%