INDEX
Explanations
terms and phrases related to corruption and bribery in governmental contexts
New Auto-Interp
Negative Logits
uckle
-0.16
logan
-0.16
CHAT
-0.15
icaret
-0.15
nore
-0.15
Stripe
-0.15
ikit
-0.15
chers
-0.15
Billing
-0.14
بÙĦ
-0.14
POSITIVE LOGITS
bri
0.32
corruption
0.30
corrupt
0.29
brib
0.29
bribery
0.28
favors
0.28
gifts
0.27
influence
0.25
corrupted
0.25
fav
0.24
Activations Density 0.146%