INDEX
Explanations
occurrences of the word "the" in various contexts
Follows "the" or possessive pronouns
the followed by specific term
New Auto-Interp
Negative Logits
,
-0.43
-0.41
I
-0.35
in
-0.35
.
-0.35
he
-0.34
There
-0.34
short
-0.34
I
-0.34
keinem
-0.33
POSITIVE LOGITS
Administrativna
0.81
ſſung
0.81
препратки
0.74
ویکیپدیا
0.73
<unused52>
0.72
<unused43>
0.72
<unused74>
0.72
<unused79>
0.72
<unused16>
0.72
<unused42>
0.72
Activations Density 0.432%