INDEX
Explanations
references to domestic activities and daily routines
references to personal possessions or ownership
New Auto-Interp
Negative Logits
hots
-0.79
nir
-0.78
thood
-0.74
upload
-0.71
isan
-0.71
namely
-0.71
Ultimately
-0.70
allowing
-0.69
rame
-0.69
etheless
-0.69
POSITIVE LOGITS
occasional
1.25
slightest
1.23
coolest
1.15
hottest
1.12
Kardash
1.11
nearest
1.10
proverbial
1.07
dreaded
1.06
smallest
1.05
fridge
1.04
Activations Density 0.355%