INDEX
Explanations
phrases indicating a search for something specific or a desired outcome
phrases indicating ongoing searches or inquiries
New Auto-Interp
Negative Logits
Democr
-0.76
soever
-0.68
owned
-0.66
ANK
-0.65
minus
-0.64
racked
-0.63
NPR
-0.62
FactoryReloaded
-0.62
notwithstanding
-0.61
own
-0.61
POSITIVE LOGITS
ways
1.11
clues
1.10
answers
1.05
solutions
1.02
scapego
0.96
excuses
0.92
alternatives
0.92
explanations
0.91
opportunities
0.89
inspiration
0.87
Activations Density 0.076%