INDEX
Explanations
modal verbs indicating possibility or uncertainty
New Auto-Interp
Negative Logits
ailer
-0.18
raya
-0.15
istra
-0.15
irie
-0.14
opard
-0.14
pson
-0.14
_endian
-0.14
ifen
-0.13
plaisir
-0.13
Insecta
-0.13
POSITIVE LOGITS
onna
0.21
ones
0.20
be
0.20
hem
0.20
nard
0.19
saja
0.19
ily
0.17
/all
0.17
est
0.16
ÏĮÏģ
0.16
Activations Density 0.099%