INDEX
Explanations
words related to rules, instructions, or commands
mentions of directives or regulations
New Auto-Interp
Negative Logits
Alto
-0.79
Argon
-0.77
ITE
-0.77
ock
-0.76
Blacks
-0.74
van
-0.74
Gordon
-0.69
OWS
-0.69
Sack
-0.68
Pens
-0.68
POSITIVE LOGITS
directives
1.24
directive
1.17
confir
0.93
Directive
0.88
guiActiveUn
0.87
nod
0.85
decree
0.84
ordering
0.83
nomine
0.83
directs
0.81
Activations Density 0.007%