INDEX
Explanations
the definite article "the"
Followed by many different nouns
New Auto-Interp
Negative Logits
morire
-0.78
+#+#
-0.78
oprot
-0.76
décid
-0.73
Roskov
-0.72
nemico
-0.70
Chwiliwch
-0.69
LookAnd
-0.69
démocr
-0.68
soñ
-0.67
POSITIVE LOGITS
the
0.67
former
0.66
entire
0.59
principal
0.59
formik
0.57
organization
0.56
enthe
0.56
system
0.55
previous
0.54
respective
0.54
Activations Density 0.405%