INDEX
Explanations
references to the word "gum."
mentions of "gum" and related forms or phrases
New Auto-Interp
Negative Logits
ourge
-0.80
Serbia
-0.74
essen
-0.73
breast
-0.67
asters
-0.63
Sard
-0.63
stanbul
-0.63
æ©Ł
-0.63
isd
-0.62
KN
-0.62
POSITIVE LOGITS
ammed
2.34
gum
1.87
lly
1.79
Gum
1.73
Nou
1.01
gow
1.01
Boyd
0.99
oru
0.97
hol
0.96
oho
0.95
Activations Density 0.050%