INDEX
Explanations
sentences that end with a period
sentences that express conclusions or definitive statements
New Auto-Interp
Negative Logits
metic
-0.75
uly
-0.68
iber
-0.67
thal
-0.67
footing
-0.66
onga
-0.65
purse
-0.65
loo
-0.64
tyr
-0.64
mosqu
-0.64
POSITIVE LOGITS
However
0.97
Additionally
0.94
Likewise
0.90
Unfortunately
0.90
Therefore
0.90
Flavoring
0.88
They
0.87
Specifically
0.86
Moreover
0.86
Furthermore
0.86
Activations Density 1.675%