INDEX
Explanations
comparative terms indicating superiority or performance, especially in a political or economic context
New Auto-Interp
Negative Logits
fal
-0.16
yclopedia
-0.16
cete
-0.15
icional
-0.14
zia
-0.14
duk
-0.14
ette
-0.14
ampo
-0.14
ÙıÙĪØ§
-0.14
ulur
-0.14
POSITIVE LOGITS
even
0.28
even
0.22
any
0.22
даже
0.19
than
0.18
çĶļèĩ³
0.18
EVEN
0.17
ever
0.17
anything
0.17
mere
0.17
Activations Density 0.147%