INDEX
    Explanations

    key events, shows, or performances mentioned in the text

    New Auto-Interp
    Negative Logits
    yon
    -0.15
    otu
    -0.14
    gradation
    -0.13
    ekli
    -0.13
    asso
    -0.13
    vrd
    -0.13
    олж
    -0.13
    ksi
    -0.13
    atern
    -0.13
    fad
    -0.13
    POSITIVE LOGITS
     features
    0.93
     feature
    0.90
    features
    0.81
     Features
    0.81
    feature
    0.77
    Features
    0.74
     Feature
    0.74
    -feature
    0.72
    Feature
    0.71
     FEATURES
    0.68
    Act Density 0.203%

    No Known Activations