INDEX
Explanations
references to civil conflict or war
New Auto-Interp
Negative Logits
avorite
-0.78
Tree
-0.75
ournal
-0.74
Transcript
-0.70
raviolet
-0.65
Dominion
-0.63
Whale
-0.62
Sigma
-0.62
HK
-0.61
Wicked
-0.60
POSITIVE LOGITS
izational
1.40
isations
1.16
ised
1.03
iza
1.00
ization
1.00
isation
1.00
iz
0.97
unrest
0.91
izing
0.91
izations
0.90
Activations Density 0.010%