INDEX
Explanations
terms related to official or confidential positions within organizations
terms related to secretive or classified information
New Auto-Interp
Negative Logits
neglect
-0.64
ibr
-0.61
assert
-0.61
Feldman
-0.60
negligent
-0.60
ICT
-0.60
pron
-0.59
Paw
-0.59
bats
-0.58
impaired
-0.58
POSITIVE LOGITS
artment
0.98
artments
0.97
ary
0.88
lain
0.87
aries
0.83
arily
0.82
compartment
0.79
roleum
0.79
oslov
0.79
arity
0.78
Activations Density 0.076%