INDEX
Negative Logits
Willow
-0.07
-0.06
-0.06
honor
-0.06
editor
-0.06
439
-0.06
Ar
-0.06
more
-0.06
собира
-0.06
AYOUT
-0.06
POSITIVE LOGITS
liquid
0.12
Liquid
0.10
Liquid
0.09
LCD
0.08
liqu
0.08
liquids
0.08
liquid
0.07
odu
0.07
ati
0.07
rita
0.07
Activations Density 0.009%