INDEX
Explanations
terms and phrases indicating exceptions or contrasting conditions
New Auto-Interp
Negative Logits
Numerade
-0.47
besonder
-0.43
حوالہ
-0.42
brows
-0.42
RECOM
-0.40
Bekasi
-0.40
Française
-0.40
篤
-0.40
啪
-0.39
yyhl
-0.38
POSITIVE LOGITS
Malgré
0.50
尽管
0.50
Meskipun
0.47
Despite
0.45
Though
0.45
Despite
0.45
Meskipun
0.44
虽
0.43
Although
0.43
ostante
0.43
Activations Density 0.545%