INDEX
Explanations
causal relationships or implications between different elements of a narrative or discourse
instances of punctuation, specifically commas used in complex or lengthy sentences
New Auto-Interp
Negative Logits
ety
-0.66
¬¼
-0.64
UF
-0.60
enary
-0.54
herent
-0.54
§
-0.53
pec
-0.53
iple
-0.52
orn
-0.50
reth
-0.50
POSITIVE LOGITS
albeit
1.08
although
0.93
namely
0.89
whereas
0.85
however
0.84
though
0.84
regardless
0.81
which
0.79
but
0.79
huh
0.79
Activations Density 1.462%