INDEX
Explanations
the definite article "the" in various contexts
New Auto-Interp
Negative Logits
zig
-0.17
rzy
-0.16
irie
-0.15
IRCLE
-0.14
lea
-0.14
rych
-0.14
話
-0.14
radu
-0.14
iona
-0.13
ensem
-0.13
POSITIVE LOGITS
standpoint
0.32
perspective
0.31
outset
0.25
oth
0.22
perspectives
0.22
beginning
0.21
Perspective
0.21
/to
0.20
umber
0.20
side
0.19
Activations Density 0.082%