INDEX
Explanations
mentions of the word "itch"
occurrences of the word "bitch."
New Auto-Interp
Negative Logits
Ducks
-0.73
ragon
-0.71
©¶æ
-0.70
isman
-0.70
untu
-0.69
anguage
-0.68
vested
-0.67
senal
-0.66
¥ŀ
-0.65
zac
-0.64
POSITIVE LOGITS
itch
1.05
imaru
1.04
itching
0.91
ITCH
0.84
fork
0.83
icago
0.82
ieri
0.80
Pitch
0.76
roll
0.75
itched
0.74
Activations Density 0.010%