INDEX
Explanations
phrases expressing contrast or impossibility
negation and phrases indicating isolation or lack
New Auto-Interp
Negative Logits
aim
-0.79
ELD
-0.78
aku
-0.74
iple
-0.73
file
-0.73
alien
-0.70
ilon
-0.70
mitter
-0.68
arten
-0.68
{"-0.66
POSITIVE LOGITS
acular
0.85
necessarily
0.76
lihood
0.76
anything
0.76
outright
0.74
Marketable
0.67
possibly
0.66
å§«
0.65
surpass
0.64
any
0.64
Activations Density 0.015%