INDEX
Explanations
occurrences of syllables and phonetic patterns resembling "swo," "lo," and "bo."
New Auto-Interp
Negative Logits
Tham
-0.18
rots
-0.17
mates
-0.16
rian
-0.15
ness
-0.15
ru
-0.15
াà¦
-0.15
bib
-0.15
erd
-0.15
introductory
-0.15
POSITIVE LOGITS
oking
0.19
ogle
0.19
cket
0.18
oling
0.18
xford
0.18
oop
0.18
yers
0.18
opers
0.18
oper
0.18
iler
0.17
Activations Density 0.089%