INDEX
Explanations
references to the mouth and related activities
New Auto-Interp
Negative Logits
º«
-0.17
impse
-0.16
hea
-0.15
æ³Ĭ
-0.15
ilt
-0.14
ildo
-0.14
ÎŃÏģ
-0.14
hya
-0.14
å¸Ń
-0.14
ATRIX
-0.14
POSITIVE LOGITS
ful
0.31
piece
0.31
wash
0.27
pieces
0.24
water
0.24
FUL
0.23
cavity
0.23
-water
0.23
feel
0.22
parts
0.21
Activations Density 0.015%