INDEX
Explanations
the presence of specific phonetic patterns or sequences in words
New Auto-Interp
Negative Logits
kaç
-0.15
chooser
-0.15
ÄĻż
-0.14
uela
-0.14
atype
-0.14
itre
-0.14
liš
-0.14
rani
-0.14
losures
-0.13
webtoken
-0.13
POSITIVE LOGITS
edList
0.14
arpa
0.14
ACTER
0.13
ffer
0.13
dear
0.13
oire
0.13
standing
0.13
ë³µ
0.13
edian
0.13
standing
0.13
Activations Density 0.295%