INDEX
Explanations
phrases related to political figures and discussions
references to specific organizations or entities, particularly in a political or policy context
New Auto-Interp
Negative Logits
DragonMagazine
-0.86
kefeller
-0.79
xit
-0.77
issance
-0.77
osaurus
-0.74
apore
-0.73
utenant
-0.70
theless
-0.70
pire
-0.69
icken
-0.68
POSITIVE LOGITS
Gamble
0.61
INTER
0.53
melan
0.53
Full
0.52
Roof
0.52
aspirin
0.51
Rew
0.51
puff
0.50
Blind
0.49
Enough
0.48
Activations Density 0.172%