INDEX
Explanations
phrases related to strategies and political discussions
New Auto-Interp
Negative Logits
apses
-0.69
assador
-0.67
uable
-0.67
esters
-0.66
ocaust
-0.65
essee
-0.64
agues
-0.64
icious
-0.64
adelphia
-0.64
atoon
-0.64
POSITIVE LOGITS
mantra
1.28
stance
1.27
principles
1.26
tenets
1.17
principle
1.17
motto
1.16
strategy
1.11
ideals
1.07
stances
1.06
regimen
1.06
Activations Density 0.146%