INDEX
    Explanations

    references to HTML and XML attributes

    New Auto-Interp
    Negative Logits
     idéia
    -0.45
     Vernunft
    -0.41
     Bühne
    -0.40
     Öffentlichkeit
    -0.40
     Ciencia
    -0.39
    occasione
    -0.37
     Verpflichtung
    -0.37
     Schritt
    -0.37
     Botschaft
    -0.36
     Verantwortung
    -0.36
    POSITIVE LOGITS
     attribute
    1.16
     att
    1.10
     Attribute
    1.05
    attr
    1.03
     attr
    1.03
    att
    0.98
    attribute
    0.96
     Att
    0.94
    Att
    0.91
     attribu
    0.89
    Act Density 0.250%

    No Known Activations