INDEX
Explanations
commas and the phrase "In fact"
instances of the phrase "In fact."
New Auto-Interp
Negative Logits
olves
-0.78
aled
-0.67
adr
-0.66
owed
-0.64
odon
-0.63
gment
-0.62
igator
-0.61
mage
-0.61
cup
-0.61
onym
-0.60
POSITIVE LOGITS
contrary
0.93
although
0.88
according
0.84
though
0.81
there
0.79
insofar
0.78
it
0.76
despite
0.76
if
0.76
unlike
0.75
Activations Density 0.085%