INDEX
Explanations
phrases indicating comparative language, particularly in the context of abundance or satisfaction
New Auto-Interp
Negative Logits
lam
-0.19
ini
-0.16
386
-0.14
Fritz
-0.14
ours
-0.14
toll
-0.14
.DataAccess
-0.13
ê¼
-0.13
ermen
-0.13
_stderr
-0.13
POSITIVE LOGITS
than
0.55
than
0.45
-than
0.44
Than
0.40
THAN
0.39
Than
0.37
_than
0.34
než
0.30
än
0.29
more
0.26
Activations Density 0.019%