INDEX
Explanations
the word "rude" with varying levels of intensity
instances of the word "dude" and its variations
New Auto-Interp
Negative Logits
ILCS
-0.95
ãĥ£
-0.82
marked
-0.80
stall
-0.77
Interstitial
-0.76
pora
-0.75
apesh
-0.74
olulu
-0.73
keep
-0.72
erald
-0.72
POSITIVE LOGITS
ude
0.77
lder
0.75
udes
0.75
xual
0.73
vich
0.69
geries
0.69
uces
0.68
cker
0.68
anu
0.68
geist
0.67
Activations Density 0.018%