INDEX
Explanations
modal verbs indicating possibility or probability
New Auto-Interp
Negative Logits
istrovstvÃŃ
-0.16
inia
-0.16
istra
-0.16
oster
-0.15
aterno
-0.15
isoft
-0.15
isas
-0.14
ailer
-0.14
ropp
-0.14
.scalablytyped
-0.14
POSITIVE LOGITS
be
0.28
onna
0.21
hem
0.20
ily
0.20
ors
0.19
oral
0.18
-have
0.17
být
0.17
nard
0.17
تÙĥÙĪÙĨ
0.17
Activations Density 0.097%