INDEX
Explanations
phrases indicating continued actions or sequences
phrases related to continuous actions or processes
New Auto-Interp
Negative Logits
Surprise
-0.86
Gleaming
-0.76
Mesa
-0.73
Yosemite
-0.72
Shelter
-0.72
Runner
-0.70
Abdel
-0.70
dylib
-0.68
Runner
-0.64
Clothing
-0.63
POSITIVE LOGITS
uninterrupted
0.81
adolesc
0.77
tions
0.76
clusively
0.75
abre
0.74
edly
0.73
unab
0.69
indefinitely
0.69
diligence
0.68
nces
0.67
Activations Density 0.238%