INDEX
Explanations
phrases ending in a particular punctuation mark
punctuation marks, specifically periods indicating the end of sentences
New Auto-Interp
Negative Logits
metic
-0.66
anmar
-0.58
IST
-0.54
IAS
-0.52
opter
-0.52
artif
-0.52
Kyoto
-0.51
NYT
-0.51
FB
-0.51
Canaver
-0.50
POSITIVE LOGITS
↵
0.97
SPONSORED
0.82
<|endoftext|>
0.76
gard
0.72
↵↵
0.66
ppard
0.66
0.57
Lt
0.55
ii
0.53
(*
0.53
Activations Density 0.182%