INDEX
Explanations
instances of the word "as" and its variations within the text
New Auto-Interp
Negative Logits
fcn
-0.20
_ENC
-0.17
etter
-0.16
following
-0.16
ãĥ©ãĥ¼
-0.16
eldre
-0.15
evidenced
-0.14
rawer
-0.14
otti
-0.14
hci
-0.14
POSITIVE LOGITS
with
0.26
such
0.26
always
0.23
luck
0.22
mentioned
0.21
pects
0.20
noted
0.20
importantly
0.20
ynchronous
0.19
previously
0.19
Activations Density 0.068%