INDEX
Explanations
phrases indicating a comparison or relationship between entities
New Auto-Interp
Negative Logits
enfans
-0.70
mstyle
-0.61
âmes
-0.52
auroit
-0.52
ſelf
-0.51
pleaſure
-0.51
vœux
-0.49
NonQuery
-0.48
feroit
-0.46
Ruß
-0.46
POSITIVE LOGITS
between
1.61
between
1.48
Between
1.46
Between
1.45
BETWEEN
1.35
BETWEEN
1.31
между
1.27
zwischen
1.26
між
1.16
tussen
1.09
Activations Density 0.141%