INDEX
Explanations
terms related to legal restrictions and prohibitions
New Auto-Interp
Negative Logits
bard
-0.77
Lenin
-0.70
Kings
-0.69
tone
-0.69
osterone
-0.68
ser
-0.66
Bee
-0.66
arse
-0.65
etics
-0.65
å°Ĩ
-0.65
POSITIVE LOGITS
anyone
0.99
interfering
0.92
tampering
0.91
anybody
0.88
discrimination
0.88
any
0.86
accessing
0.84
¿½
0.83
undue
0.82
disclosing
0.80
Activations Density 0.037%