INDEX
Explanations
references to HTML and XML attributes
New Auto-Interp
Negative Logits
idéia
-0.45
Vernunft
-0.41
Bühne
-0.40
Öffentlichkeit
-0.40
Ciencia
-0.39
occasione
-0.37
Verpflichtung
-0.37
Schritt
-0.37
Botschaft
-0.36
Verantwortung
-0.36
POSITIVE LOGITS
attribute
1.16
att
1.10
Attribute
1.05
attr
1.03
attr
1.03
att
0.98
attribute
0.96
Att
0.94
Att
0.91
attribu
0.89
Activations Density 0.250%