INDEX
Explanations
comparative expressions related to numerical values or measurements
New Auto-Interp
Negative Logits
eniable
-0.17
sWith
-0.17
overall
-0.16
phe
-0.15
more
-0.15
fewer
-0.15
itz
-0.15
rim
-0.14
ares
-0.14
eden
-0.14
POSITIVE LOGITS
than
0.44
Than
0.34
Than
0.33
_than
0.33
than
0.32
THAN
0.30
äºİ
0.29
än
0.25
_THAN
0.23
æĸ¼
0.23
Activations Density 0.060%