INDEX
    Explanations

    musical notations or symbols

    symbols, annotations, and music cues

    New Auto-Interp
    Negative Logits
    ynb
    -0.49
    🤯
    -0.47
    🫠
    -0.46
    ̄
    -0.45
    🤦
    -0.44
    🤬
    -0.43
    🥱
    -0.43
    🤮
    -0.42
     🤦
    -0.42
    ftagPool
    -0.42
    POSITIVE LOGITS
    1.07
    0.93
    0.91
    0.90
     ★
    0.89
     ►
    0.88
    0.87
    0.86
    0.85
    0.83
    Act Density 0.019%

    No Known Activations