INDEX
Explanations
adjectives and descriptive terms related to behavioral traits or characteristics
New Auto-Interp
Negative Logits
tains
-0.71
ocular
-0.66
empt
-0.64
Clar
-0.63
ozo
-0.63
hon
-0.62
othe
-0.61
zyme
-0.61
odder
-0.61
ioxide
-0.61
POSITIVE LOGITS
luster
0.89
nesses
0.82
owing
0.75
mson
0.75
miser
0.72
due
0.72
blacks
0.66
Worse
0.65
blight
0.65
Pradesh
0.65
Activations Density 0.165%