INDEX
    Explanations

    proper nouns referring to individuals, likely names

    references to specific individuals, particularly "Gret" and related names

    New Auto-Interp
    Negative Logits
    shirt
    -0.80
    xual
    -0.78
    termin
    -0.72
    shirts
    -0.70
    panel
    -0.69
     checkout
    -0.67
    zees
    -0.64
    REDACTED
    -0.62
     Pigs
    -0.62
     Jinping
    -0.61
    POSITIVE LOGITS
    alus
    0.93
    sburg
    0.89
    olf
    0.84
    heim
    0.80
    inka
    0.79
    ür
    0.79
    ald
    0.78
    ersen
    0.75
    alf
    0.75
    ij
    0.74
    Act Density 0.030%

    No Known Activations