INDEX
Explanations
mentions of specific individuals and their actions in a narrative context
New Auto-Interp
Negative Logits
currently
-0.17
Äijang
-0.16
currently
-0.14
abbo
-0.14
Currently
-0.14
chw
-0.14
already
-0.14
缮åīį
-0.13
tomorrow
-0.13
ervations
-0.13
POSITIVE LOGITS
sometimes
0.27
variably
0.26
often
0.25
sometimes
0.25
whenever
0.24
always
0.24
often
0.24
always
0.24
occasionally
0.22
vždy
0.22
Activations Density 0.276%