INDEX
Explanations
transitional phrases indicating contrast or exceptions in arguments
New Auto-Interp
Negative Logits
QB
-0.75
ãĤ©
-0.74
äºĶ
-0.66
shortest
-0.66
cooldown
-0.64
çľ
-0.62
Nation
-0.61
é¾
-0.61
beginner
-0.60
stall
-0.60
POSITIVE LOGITS
nevertheless
1.34
nonetheless
1.22
anecd
0.86
luckily
0.80
efficients
0.76
ebus
0.73
poons
0.73
ulia
0.70
netflix
0.70
tsky
0.69
Activations Density 0.273%