INDEX
Explanations
words related to sensitive or confidential information
terms related to secrecy and serious incidents
New Auto-Interp
Negative Logits
̶
-0.88
ecause
-0.84
ocrates
-0.81
olkien
-0.77
Ô
-0.74
[_
-0.73
cius
-0.71
akespeare
-0.71
olean
-0.70
ventus
-0.68
POSITIVE LOGITS
ordeal
1.14
incident
1.11
arrangement
1.06
portion
1.06
event
1.03
endeavor
1.03
affair
1.02
probe
1.02
operation
1.01
edition
1.00
Activations Density 0.419%