INDEX
Explanations
words related to exploiting legal or regulatory loopholes
references to legal loopholes and ways to exploit them
New Auto-Interp
Negative Logits
semble
-0.70
oran
-0.69
grave
-0.69
Kinnikuman
-0.68
haps
-0.66
yss
-0.66
pastoral
-0.65
Ķ
-0.63
ser
-0.61
image
-0.60
POSITIVE LOGITS
loopholes
1.55
loophole
1.46
glers
0.98
exemptions
0.93
deductions
0.84
witz
0.81
circumvent
0.80
ãĤµãĥ¼ãĥĨãĤ£ãĥ¯ãĥ³
0.80
cheat
0.75
backdoor
0.72
Activations Density 0.009%