INDEX
Explanations
phrases related to the condition and characteristics of physical items for sale
New Auto-Interp
Negative Logits
«
-0.56
her
-0.51
‘
-0.49
inter
-0.49
de
-0.46
«
-0.46
sur
-0.45
fore
-0.45
fa
-0.44
pe
-0.44
POSITIVE LOGITS
itſelf
1.10
ſever
1.09
myſelf
1.08
ſtate
1.04
Diſ
1.04
reaſon
1.03
Monfieur
0.99
<=",
0.99
Anſ
0.98
pleaſure
0.96
Activations Density 0.334%