INDEX
Explanations
punctuation marks signaling a contrast or transition in a text
instances of the word "However."
New Auto-Interp
Negative Logits
ULAR
-0.74
SourceFile
-0.70
pecially
-0.67
ni
-0.65
ige
-0.62
SI
-0.62
pack
-0.61
grand
-0.61
arez
-0.60
rolled
-0.60
POSITIVE LOGITS
unlike
1.06
alas
1.05
beware
0.94
chery
0.88
according
0.86
insofar
0.85
interestingly
0.82
despite
0.81
there
0.80
nevertheless
0.79
Activations Density 0.085%