INDEX
Explanations
words related to options, choices, variations, and different configurations
references to variations or differences in context or characteristics
New Auto-Interp
Negative Logits
aught
-0.79
ulner
-0.76
========
-0.71
average
-0.64
elfare
-0.60
+++
-0.60
newcomers
-0.60
olicited
-0.59
apego
-0.59
raised
-0.57
POSITIVE LOGITS
places
1.34
directions
1.32
contexts
1.31
locations
1.25
direction
1.19
guise
1.18
manner
1.18
vicinity
1.15
fashion
1.10
manners
1.10
Activations Density 0.185%