INDEX
Explanations
phrases related to quotes or statements being said
periods at the end of statements
New Auto-Interp
Negative Logits
skirm
-0.74
ensibly
-0.68
spoiled
-0.67
assassin
-0.66
silly
-0.66
slightest
-0.65
toy
-0.64
midrange
-0.64
unlucky
-0.64
transact
-0.64
POSITIVE LOGITS
Refer
0.96
↵↵
0.89
Elsewhere
0.85
[+
0.84
Another
0.83
Earlier
0.82
org
0.82
Accessed
0.81
Meanwhile
0.81
<|endoftext|>
0.80
Activations Density 0.210%