INDEX
Explanations
sentences ending with a full stop
sentences and punctuation marks
New Auto-Interp
Negative Logits
thous
-0.62
oun
-0.61
ãĤµãĥ¼ãĥĨãĤ£ãĥ¯ãĥ³
-0.61
newcom
-0.58
pse
-0.57
tremend
-0.57
untarily
-0.55
metic
-0.54
£ı
-0.53
ŃĶ
-0.53
POSITIVE LOGITS
↵
1.55
<|endoftext|>
1.31
↵↵
0.93
ðŁĻĤ
0.89
®
0.83
SPONSORED
0.78
Including
0.77
Especially
0.72
ðŁĺ
0.68
;)
0.66
Activations Density 0.600%