INDEX
Explanations
instances of the word "only" to emphasize limitations or exclusivity
New Auto-Interp
Negative Logits
bourg
-0.15
antt
-0.14
ypes
-0.14
пÑĥÑĤ
-0.14
stry
-0.14
ldkf
-0.14
arges
-0.13
lement
-0.13
Petty
-0.13
ant
-0.13
POSITIVE LOGITS
udo
0.17
ìķĻ
0.14
oth
0.14
ем
0.14
.Constraint
0.14
fans
0.14
beg
0.14
armac
0.14
heim
0.14
ament
0.14
Activations Density 0.076%