INDEX
Explanations
phrases related to quantity or degree
references to specific quantities, types, or characteristics related to objects and situations
New Auto-Interp
Negative Logits
perse
-0.67
kidding
-0.65
heid
-0.62
unden
-0.62
vain
-0.60
imming
-0.59
Yourself
-0.57
Debor
-0.56
BuyableInstoreAndOnline
-0.56
Tro
-0.55
POSITIVE LOGITS
thresholds
0.75
Ore
0.75
olson
0.70
threshold
0.69
Keefe
0.66
certain
0.64
odox
0.62
catentry
0.62
ASON
0.62
undefined
0.61
Activations Density 0.150%