INDEX
Explanations
usernames or handles with 'ub' in them
occurrences of the substring "ub" in different contexts
New Auto-Interp
Negative Logits
Lauder
-0.69
drift
-0.69
ORIG
-0.66
Atlantic
-0.66
Clash
-0.64
ultraviolet
-0.62
ozone
-0.62
butterflies
-0.61
Irma
-0.60
Overt
-0.60
POSITIVE LOGITS
lishing
1.36
ilee
1.27
ilant
1.24
lique
1.22
rious
1.21
bing
1.17
bish
1.12
lisher
1.11
lish
1.11
bles
1.08
Activations Density 0.030%