INDEX
    Explanations

    features of cultural artifacts and historical contexts

    New Auto-Interp
    Negative Logits
    otime
    -0.16
    intree
    -0.16
    nici
    -0.16
    asmus
    -0.16
    achi
    -0.16
    ATOM
    -0.15
    creativecommons
    -0.15
    ervas
    -0.15
     vein
    -0.15
    eros
    -0.14
    POSITIVE LOGITS
    hev
    0.16
    aller
    0.15
    stamp
    0.15
     Magazine
    0.15
    edo
    0.14
    osy
    0.14
     Dy
    0.14
    Ctrls
    0.13
     converse
    0.13
     stamp
    0.13
    Act Density 0.236%

    No Known Activations