INDEX
Explanations
references to legal matters or regulations
phrases related to legal status or legality
New Auto-Interp
Negative Logits
Seasons
-0.84
mble
-0.72
pread
-0.70
Tycoon
-0.69
Tate
-0.68
ritch
-0.68
Clip
-0.67
Wing
-0.67
Roses
-0.66
Velocity
-0.66
POSITIVE LOGITS
enforce
0.91
obliged
0.83
sanctioned
0.81
obligated
0.80
mandated
0.78
speaking
0.76
compliant
0.76
wedd
0.74
exting
0.74
inition
0.74
Activations Density 0.026%