INDEX
Explanations
mentions of geopolitical entities and political figures
terms related to political and governmental entities
New Auto-Interp
Negative Logits
Ô
-0.70
thood
-0.69
thia
-0.67
chwitz
-0.64
ombat
-0.64
ĸļ
-0.64
xual
-0.63
cycles
-0.61
hops
-0.61
atown
-0.60
POSITIVE LOGITS
bureau
0.94
agency
0.83
equivalent
0.81
secretary
0.75
industry
0.72
watchdog
0.71
version
0.71
department
0.71
interpreter
0.71
playbook
0.71
Activations Density 0.621%