INDEX
Explanations
phrases related to government policies and actions
phrases that include the word "of" followed by contextual mentions relating to various social or political systems
New Auto-Interp
Negative Logits
worthy
-0.75
orthy
-0.75
acious
-0.74
reperto
-0.69
Extras
-0.68
unit
-0.66
oor
-0.64
èĢħ
-0.64
paces
-0.63
bucks
-0.63
POSITIVE LOGITS
ours
0.89
separation
0.88
treating
0.85
theirs
0.84
separating
0.84
distributing
0.83
relying
0.82
avoiding
0.82
hers
0.82
electing
0.82
Activations Density 0.145%