INDEX
Explanations
mentions of the word "rude" in various contexts
the word "dude" and its variations in different contexts
New Auto-Interp
Negative Logits
ãĥ£
-0.92
ILCS
-0.89
awaru
-0.88
riors
-0.81
marked
-0.79
pora
-0.79
stall
-0.77
olulu
-0.76
iott
-0.75
apesh
-0.75
POSITIVE LOGITS
gger
0.77
anu
0.76
xual
0.74
uces
0.74
cker
0.72
mber
0.69
icone
0.69
lder
0.68
geries
0.67
Olson
0.67
Activations Density 0.028%