INDEX
    Explanations

    references to music and tunes

    New Auto-Interp
    Negative Logits
    %)$
    -0.78
    featureID
    -0.78
     فريبيس
    -0.75
    :])
    -0.72
    jandra
    -0.72
    ercises
    -0.71
    rungsseite
    -0.69
    )++;
    -0.68
    ()))
    
    -0.68
    PerformLayout
    -0.68
    POSITIVE LOGITS
     tune
    1.22
    Tune
    1.11
     Tune
    1.08
    tune
    0.93
     tuned
    0.80
     tuning
    0.75
     Tunes
    0.69
     tunes
    0.69
    Tunes
    0.68
    Hint
    0.67
    Act Density 0.136%

    No Known Activations