INDEX
Explanations
phrases related to preferences or selections
words related to preferences and conditions affecting individuals or groups
New Auto-Interp
Negative Logits
osaurs
-0.91
OPE
-0.79
izable
-0.78
apy
-0.71
ozo
-0.70
GOODMAN
-0.69
ãĥīãĥ©ãĤ´ãĥ³
-0.69
Accessory
-0.69
Reviewer
-0.69
IZE
-0.69
POSITIVE LOGITS
erent
1.16
eren
1.00
lot
0.90
erences
0.88
ixed
0.86
elt
0.85
fw
0.82
liction
0.79
illed
0.77
raid
0.76
Activations Density 0.014%