INDEX
Explanations
sentences ending with a full stop
punctuations, specifically periods at the end of statements
New Auto-Interp
Negative Logits
uly
-0.87
iber
-0.74
thal
-0.71
arov
-0.71
opponent
-0.71
ãĤ§
-0.71
helper
-0.69
objective
-0.68
organis
-0.68
content
-0.65
POSITIVE LOGITS
Worse
1.04
But
1.02
Eventually
0.98
Their
0.97
And
0.96
They
0.96
Luckily
0.96
Whatever
0.94
Amid
0.94
Suddenly
0.94
Activations Density 1.179%