INDEX
Explanations
phrases indicating specific points or facts
the conjunction "that" and its repeated emphasis in sentences
New Auto-Interp
Negative Logits
oses
-0.81
Tam
-0.66
cept
-0.65
zman
-0.65
Pont
-0.65
HO
-0.65
apsed
-0.62
agn
-0.62
MH
-0.62
eal
-0.61
POSITIVE LOGITS
although
0.94
"[
0.87
there
0.86
whereas
0.78
chery
0.78
whilst
0.77
unlike
0.76
they
0.72
while
0.71
despite
0.70
Activations Density 0.150%