INDEX
    Explanations
    New Auto-Interp
    Negative Logits
     `
    0.57
    -
    0.53
     *
    0.46
     mean
    0.45
     initial
    0.45
    a
    0.45
     ultimate
    0.45
     technical
    0.44
     "
    0.44
     ultimately
    0.42
    POSITIVE LOGITS
     Girl
    0.82
     drummer
    0.80
     Girls
    0.79
     guitarist
    0.76
    Girls
    0.76
     girl
    0.75
     menina
    0.75
     Mädchen
    0.75
    Girl
    0.74
    女孩
    0.74
    Act Density 0.104%

    No Known Activations