INDEX
Explanations
comparative phrases that highlight differences or contrasts
New Auto-Interp
Negative Logits
Patria
-0.71
onix
-0.70
bufio
-0.70
Melayu
-0.69
HSP
-0.69
Bier
-0.68
ostavi
-0.68
Jeong
-0.67
ěte
-0.65
ous
-0.63
POSITIVE LOGITS
THAN
1.95
than
1.77
Than
1.59
Than
1.52
THAN
1.35
than
1.31
än
1.26
niż
1.18
decât
1.18
než
1.16
Activations Density 0.149%