INDEX
Explanations
sentences indicating a continuation or progression of a situation or action
phrases indicating continuation or ongoing actions
New Auto-Interp
Negative Logits
Surprise
-0.94
Skin
-0.75
Yosemite
-0.71
Abdel
-0.70
Ha
-0.69
Mesa
-0.68
Runner
-0.68
Jaw
-0.67
Ans
-0.66
Wrap
-0.65
POSITIVE LOGITS
tions
0.89
abre
0.83
adolesc
0.82
clusively
0.80
:]
0.76
nces
0.75
ussions
0.73
orks
0.73
azeera
0.72
rily
0.72
Activations Density 0.182%