INDEX
Explanations
references to political positions, particularly related to governmental roles or controversies
New Auto-Interp
Negative Logits
ISM
-0.77
Lions
-0.76
Abyssal
-0.71
Salvation
-0.71
brackets
-0.69
Worlds
-0.67
STAR
-0.66
dots
-0.66
ritch
-0.66
Italians
-0.66
POSITIVE LOGITS
chool
1.31
cription
1.29
ervation
1.21
umed
1.20
erves
1.19
byter
1.17
ervative
1.15
erving
1.11
idency
1.11
erver
1.10
Activations Density 6.676%