INDEX
Explanations
politically-oriented words related to different ideologies
references to various political and ideological groups
New Auto-Interp
Negative Logits
Delivery
-0.79
Owner
-0.74
STA
-0.65
INAL
-0.64
wise
-0.62
geological
-0.61
inventoryQuantity
-0.61
circumstances
-0.60
CHECK
-0.60
Bulldogs
-0.60
POSITIVE LOGITS
ervatives
1.44
ervative
1.39
aurus
1.28
paces
1.25
pace
0.98
rejoice
0.97
hip
0.96
chool
0.96
mith
0.96
'
0.94
Activations Density 0.098%