INDEX
Explanations
punctuation marks at the end of sentences
New Auto-Interp
Negative Logits
ļéĨĴ
-0.73
interstitial
-0.68
SHIP
-0.64
robe
-0.62
haul
-0.61
ername
-0.58
dragon
-0.58
olves
-0.55
ufact
-0.55
WAYS
-0.55
POSITIVE LOGITS
however
1.27
though
1.07
huh
1.04
moreover
1.00
sir
0.96
alas
0.95
incidentally
0.88
eh
0.87
anyway
0.87
albeit
0.85
Activations Density 0.366%