INDEX
Explanations
statements related to government responsibility and potential consequences
New Auto-Interp
Negative Logits
colored
-1.10
defense
-1.03
avored
-0.96
favors
-0.82
rollment
-0.82
Defense
-0.80
caliber
-0.80
imize
-0.80
avior
-0.78
Shiite
-0.77
POSITIVE LOGITS
recognise
1.65
realise
1.61
recognised
1.50
apologise
1.40
Whilst
1.37
Ukip
1.35
paed
1.34
behaviours
1.34
whilst
1.34
recogn
1.32
Activations Density 1.792%