INDEX
Explanations
phrases related to quality and concern
New Auto-Interp
Negative Logits
idav
-0.92
plets
-0.77
MAP
-0.77
etsk
-0.71
skirts
-0.70
adan
-0.66
pes
-0.65
SN
-0.63
Shut
-0.63
rises
-0.60
POSITIVE LOGITS
importance
1.26
magnitude
1.23
stature
1.19
proportions
1.14
caliber
1.13
origin
1.10
renown
1.08
relevance
1.05
size
1.05
calib
1.03
Activations Density 0.159%