INDEX
Explanations
mentions of specific numbers or rankings
the definite article "the" in various contexts
New Auto-Interp
Negative Logits
âĢİ
-0.73
leeve
-0.67
Joined
-0.66
Akin
-0.62
Fiona
-0.62
fulness
-0.60
èª
-0.60
Gohan
-0.59
chu
-0.59
imaru
-0.58
POSITIVE LOGITS
aforementioned
0.97
latter
0.95
heaviest
0.92
same
0.87
smallest
0.84
largest
0.81
proverbial
0.81
highest
0.79
utmost
0.79
worst
0.79
Activations Density 0.360%