INDEX
Explanations
contrastive phrases and qualifiers that indicate skepticism or critique
concessions and contrasts
New Auto-Interp
Negative Logits
RegressionTest
-0.56
Espèce
-0.40
省市镇
-0.39
bnf
-0.36
setof
-0.35
Hor
-0.34
anguardia
-0.34
izop
-0.34
UIControlState
-0.34
nestjs
-0.33
POSITIVE LOGITS
nonetheless
0.65
'\\;'
0.56
nevertheless
0.56
终究
0.56
0.52
Nonetheless
0.52
Dennoch
0.52
enderror
0.50
Nonetheless
0.50
AttributeSet
0.50
Activations Density 0.072%