INDEX
    Explanations

    references to classic music and entertainment

    New Auto-Interp
    Negative Logits
     classic
    -0.25
     classical
    -0.23
     Classic
    -0.23
    Classic
    -0.23
    CLASS
    -0.22
     klass
    -0.22
     Classical
    -0.21
    classic
    -0.21
     Classics
    -0.21
     classics
    -0.20
    POSITIVE LOGITS
    -era
    0.21
    ists
    0.21
    /mod
    0.21
    -rock
    0.19
    ical
    0.18
    ically
    0.18
    ediator
    0.18
     же
    0.18
    ism
    0.17
    icism
    0.17
    Act Density 0.020%

    No Known Activations