INDEX
Explanations
phrases related to personal responsibility and consequences
New Auto-Interp
Negative Logits
currently
-0.88
interstitial
-0.77
eeper
-0.72
urry
-0.66
FIG
-0.66
aldi
-0.64
SPONSORED
-0.63
urrent
-0.62
current
-0.62
odge
-0.61
POSITIVE LOGITS
yesterday
0.96
wrong
0.88
terday
0.84
ago
0.84
nob
0.81
incompet
0.77
Watergate
0.76
countless
0.74
fools
0.74
greatness
0.73
Activations Density 0.757%