INDEX
Explanations
instances of the word "but" emphasizing contrast or transition in discussions
New Auto-Interp
Negative Logits
ÄĻż
-0.15
icer
-0.14
maar
-0.14
ä½Ĩæĺ¯
-0.14
ouver
-0.14
yet
-0.14
agment
-0.14
phant
-0.13
transparent
-0.13
oor
-0.13
POSITIVE LOGITS
cher
0.17
tery
0.16
ystack
0.16
despite
0.16
tk
0.15
chie
0.15
tern
0.14
Appe
0.14
lauf
0.14
auer
0.14
Activations Density 0.087%