INDEX
    Explanations

    phrases that emphasize universality or collective experiences

    New Auto-Interp
    Negative Logits
    ThroughAttribute
    -0.78
     myſelf
    -0.77
    Pratique
    -0.72
    contentLoaded
    -0.71
    aarrggbb
    -0.71
    Kaynakça
    -0.70
     purpoſe
    -0.69
    ToBounds
    -0.69
     wireType
    -0.68
    };*/
    -0.67
    POSITIVE LOGITS
    畢竟
    0.86
    毕竟
    0.85
    ведь
    0.70
     hey
    0.65
     why
    0.64
     ведь
    0.61
     isn
    0.60
     what
    0.58
     Ведь
    0.58
    要知道
    0.57
    Act Density 0.068%

    No Known Activations