INDEX
Explanations
conjunctions followed by a contrasting statement
the conjunction "But" in various contexts
New Auto-Interp
Negative Logits
heads
-0.76
segment
-0.72
pier
-0.63
cloth
-0.63
obyl
-0.59
¯¯¯¯
-0.58
sense
-0.58
ceremony
-0.58
srfN
-0.57
ãģ®
-0.57
POSITIVE LOGITS
tons
1.24
alas
0.95
romeda
0.83
theless
0.82
chers
0.80
luckily
0.77
ts
0.77
LER
0.74
chery
0.73
hey
0.72
Activations Density 0.100%