INDEX
Explanations
instances of the word "but" and variations of contrastive conjunctions
New Auto-Interp
Negative Logits
astify
-0.56
сылкі
-0.55
dyž
-0.54
Obrador
-0.50
cticut
-0.50
still
-0.48
useAppContext
-0.48
onBackPressed
-0.48
Still
-0.47
Picchu
-0.45
POSITIVE LOGITS
increasingly
0.65
artık
0.56
ormai
0.49
mittlerweile
0.49
progressively
0.47
coraz
0.46
不再
0.45
gradually
0.45
zuneh
0.42
inzwischen
0.40
Activations Density 0.048%