INDEX
Explanations
qualifying adjectives and adverbs
New Auto-Interp
Negative Logits
but
4.42
but
4.01
nhưng
3.69
But
3.67
pero
3.64
但
3.62
But
3.54
ولكن
3.53
लेकिन
3.50
אך
3.28
POSITIVE LOGITS
మాత్రం
1.18
그걸
0.82
بعدها
0.72
afterwards
0.66
によっては
0.66
)):
0.65
nocześnie
0.65
வேற
0.65
其余
0.63
remaining
0.61
Activations Density 0.181%