INDEX
Explanations
sentence endings or specific symbols that signal the end of a text
expressions of strong emotional or sensory experiences
New Auto-Interp
Negative Logits
xual
-0.84
scrap
-0.79
onte
-0.79
Blackwell
-0.73
gard
-0.71
allied
-0.69
friendly
-0.67
interested
-0.67
hes
-0.66
ional
-0.65
POSITIVE LOGITS
Reason
1.01
Section
1.01
?????-?????-
0.99
Unless
0.96
Added
0.93
Writing
0.93
Comment
0.91
Most
0.90
Since
0.88
advertising
0.88
Activations Density 0.214%