INDEX
Explanations
phrases related to patriotism and national pride
New Auto-Interp
Negative Logits
ÄŁ
-0.68
odcast
-0.66
ptions
-0.66
McMaster
-0.64
turnover
-0.62
-0.62
basic
-0.62
decentral
-0.62
CMS
-0.60
extr
-0.60
POSITIVE LOGITS
And
0.90
Where
0.88
Unt
0.86
Shall
0.82
Which
0.82
Were
0.81
ORN
0.81
Cause
0.81
Who
0.79
cause
0.78
Activations Density 0.121%