INDEX
    Explanations
    No Explanations Found
    New Auto-Interp
    Negative Logits
    ropri
    -0.67
    person
    -0.67
    ela
    -0.67
    woman
    -0.65
    gal
    -0.62
     Sounds
    -0.62
    ZI
    -0.62
    wife
    -0.61
    Si
    -0.61
    cow
    -0.61
    POSITIVE LOGITS
    unker
    0.71
    ģĸ
    0.69
     Thunderbolt
    0.67
    acers
    0.67
     Seraph
    0.66
    leep
    0.66
     Archangel
    0.66
    phrine
    0.65
    cipled
    0.65
    ĸļ
    0.65
    Act Density 0.000%

    No Known Activations

    This feature has no known activations.