INDEX
Explanations
phrases related to protection and defense
references to protection, security, and safeguarding of individuals or groups
New Auto-Interp
Negative Logits
onwards
-0.74
sonian
-0.73
NEWS
-0.64
Likely
-0.63
Difficulty
-0.62
reckoning
-0.59
psy
-0.58
Cart
-0.57
cart
-0.57
Trouble
-0.56
POSITIVE LOGITS
integrity
0.94
egu
0.92
yrights
0.91
confidentiality
0.91
secrets
0.87
riott
0.86
dignity
0.84
tnc
0.82
privacy
0.78
endangered
0.77
Activations Density 0.198%