INDEX
Explanations
phrases indicating lack of effectiveness or limitations in guidelines and strategies
preceding a contrast or contradiction
but introduces contrast or exception
New Auto-Interp
Negative Logits
пусть
-0.58
niech
-0.52
但不
-0.51
なら
-0.50
也算是
-0.49
λοι
-0.48
BagConstraints
-0.48
כן
-0.48
也可
-0.46
gärna
-0.46
POSITIVE LOGITS
neither
1.01
なかなか
0.99
none
0.93
neither
0.90
Unless
0.90
Unless
0.89
unless
0.89
lacks
0.88
残念ながら
0.88
Neither
0.88
Activations Density 0.590%