INDEX
Explanations
references to political or legal mandates
references to legal mandates or mandatory policies
New Auto-Interp
Negative Logits
istics
-1.03
folk
-0.91
stocks
-0.88
hop
-0.87
ophon
-0.84
ienne
-0.82
Hop
-0.78
can
-0.77
cases
-0.77
ihad
-0.76
POSITIVE LOGITS
compliance
0.79
enforced
0.76
creep
0.76
decree
0.75
mandate
0.75
conformity
0.74
mand
0.73
obedience
0.72
imposed
0.72
breaker
0.71
Activations Density 0.077%