INDEX
Explanations
phrases related to evaluation and comparative analysis
New Auto-Interp
Negative Logits
but
-0.28
omo
-0.22
however
-0.20
ellar
-0.18
aber
-0.18
nhưng
-0.18
but
-0.18
But
-0.17
maar
-0.17
ä½Ĩ
-0.17
POSITIVE LOGITS
isku
0.15
#ab
0.14
baugh
0.14
оÑĥ
0.14
олож
0.13
stadt
0.13
nonetheless
0.13
è¿ĺæĺ¯
0.13
utory
0.13
å½
0.13
Activations Density 0.061%