INDEX
Explanations
words related to physical health issues, especially nosebleeds, and corporate terms
words that relate to the concept of being "able" or "capable."
New Auto-Interp
Negative Logits
alli
-0.83
arro
-0.78
irtual
-0.72
okin
-0.71
ellar
-0.69
ERA
-0.69
Palestin
-0.68
VERTISEMENT
-0.67
ors
-0.66
erva
-0.65
POSITIVE LOGITS
bles
1.13
theless
1.09
tt
0.95
grass
0.94
bled
0.93
leaf
0.92
vous
0.92
cht
0.90
bling
0.90
heads
0.87
Activations Density 0.043%