INDEX
Explanations
phrases or words related to authority figures
mentions of legislative or legal terminology related to rules or structures
New Auto-Interp
Negative Logits
Drawn
-0.76
Mirage
-0.75
mounts
-0.72
passions
-0.71
avenues
-0.71
Sammy
-0.71
intu
-0.71
counsel
-0.70
roots
-0.70
sources
-0.69
POSITIVE LOGITS
uthor
1.13
reci
1.05
vernment
1.04
£
1.01
amily
1.00
very
0.97
little
0.95
actual
0.95
significant
0.95
swer
0.94
Activations Density 0.081%