INDEX
    Explanations

    occurrences of specific noun forms or phrases related to artistic reviews

    New Auto-Interp
    Negative Logits
    æľĹ
    -0.15
    lasses
    -0.15
     Romero
    -0.15
    wm
    -0.14
     ciz
    -0.14
    olio
    -0.14
    eton
    -0.14
    iddi
    -0.14
    ederland
    -0.14
    ifton
    -0.14
    POSITIVE LOGITS
     by
    0.17
     mans
    0.16
     sa
    0.16
    d
    0.15
    ous
    0.15
     linear
    0.15
    rag
    0.15
    Ïģθ
    0.14
    andel
    0.14
     starting
    0.14
    Act Density 0.002%

    No Known Activations