INDEX
Explanations
instances of conditional phrases and their implications
New Auto-Interp
Negative Logits
evi
-0.15
елем
-0.14
063
-0.14
assel
-0.14
angan
-0.13
ijing
-0.13
858
-0.13
ermal
-0.13
atik
-0.13
elpers
-0.12
POSITIVE LOGITS
anyway
1.09
Anyway
0.99
Anyway
0.97
anyways
0.93
anyhow
0.78
nonetheless
0.54
nevertheless
0.51
toch
0.47
Nevertheless
0.47
Nonetheless
0.46
Activations Density 1.155%