INDEX
Explanations
words related to various strong beliefs or ideologies
terms associated with various ideological groups and beliefs
New Auto-Interp
Negative Logits
Delivery
-0.77
increments
-0.75
Owner
-0.64
shown
-0.61
adium
-0.60
PROG
-0.60
Performance
-0.59
bilateral
-0.59
Sau
-0.59
ãĥ¼ãĥ³
-0.57
POSITIVE LOGITS
paces
1.36
ervatives
1.36
ervative
1.31
hip
1.17
hips
1.11
rejoice
1.08
pace
1.05
cale
1.01
alike
0.98
peak
0.98
Activations Density 0.198%