INDEX
Explanations
words related to political events and actions
New Auto-Interp
Negative Logits
neath
-0.81
maxwell
-0.77
lux
-0.77
tick
-0.74
̶
-0.73
aclysm
-0.72
isible
-0.71
agen
-0.70
ÃįÃį
-0.70
unique
-0.70
POSITIVE LOGITS
vowed
1.11
urged
1.07
thereby
1.03
insisted
1.03
undertook
1.02
withdrew
1.02
demanded
1.01
oversaw
0.98
apologized
0.95
instituted
0.95
Activations Density 0.285%