INDEX
Explanations
comparative phrases that indicate degrees or changes in concepts
phrases related to comparisons and timelines
New Auto-Interp
Negative Logits
Ế
-0.45
uar
-0.42
ždý
-0.41
ől
-0.41
giao
-0.41
espère
-0.41
uel
-0.40
assoluto
-0.39
raz
-0.39
forcément
-0.39
POSITIVE LOGITS
more
1.48
inkább
1.34
more
1.29
closer
1.25
скорее
1.22
raczej
1.22
vielmehr
1.20
More
1.19
eher
1.14
More
1.13
Activations Density 0.270%