INDEX
Explanations
paragraphs expressing personal reflections and emotional intensity
New Auto-Interp
Negative Logits
days
-0.71
Digest
-0.71
etheless
-0.69
lus
-0.63
eming
-0.62
Provides
-0.58
Coverage
-0.56
Received
-0.55
ept
-0.54
arel
-0.54
POSITIVE LOGITS
aside
1.02
emphasis
0.85
pedal
0.83
brakes
0.83
jeopardy
0.83
together
0.81
lid
0.81
toget
0.80
spotlight
0.78
rescent
0.76
Activations Density 0.125%