INDEX
Explanations
phrases related to changes, developments, or conditions that are noteworthy or indicative of a specific state
phrases that refer to signs or indications of various conditions or situations
New Auto-Interp
Negative Logits
quet
-0.80
eah
-0.74
chens
-0.74
ials
-0.73
nets
-0.72
adelphia
-0.71
omers
-0.71
apest
-0.69
ourses
-0.69
iband
-0.69
POSITIVE LOGITS
decay
0.88
bias
0.77
fatigue
0.77
aggression
0.74
occupancy
0.74
warmth
0.74
distress
0.73
desperation
0.72
progress
0.71
differentiation
0.71
Activations Density 0.056%