INDEX
Explanations
comparative phrases that imply contrasts or conditions
New Auto-Interp
Negative Logits
Either
-0.91
Either
-0.88
invece
-0.85
UnusedPrivate
-0.81
更是
-0.79
either
-0.78
malah
-0.78
either
-0.78
inoltre
-0.78
dessutom
-0.77
POSITIVE LOGITS
technically
1.44
nominally
1.28
initially
1.23
admittedly
1.19
ostensibly
1.14
outwardly
1.10
téc
1.05
superfic
1.04
theoretically
1.04
undoubtedly
0.96
Activations Density 0.600%