INDEX
Explanations
phrases indicating a change or transition
phrases indicating a change or recent developments
New Auto-Interp
Negative Logits
umbn
-0.68
lez
-0.65
Ops
-0.63
vec
-0.63
Springfield
-0.61
yrics
-0.61
comp
-0.60
Apostle
-0.58
EntityItem
-0.58
``
-0.58
POSITIVE LOGITS
DEN
0.82
unsus
0.74
unnoticed
0.73
belonged
0.68
dating
0.67
ione
0.65
unknown
0.65
í
0.64
inction
0.64
hesitated
0.64
Activations Density 0.066%