INDEX
    Explanations
    New Auto-Interp
    Negative Logits
     parfüm
    -2.03
     宣传
    -1.78
    #%%
    
    -1.77
    -1.74
    ##############
    -1.73
     flesta
    -1.70
    -1.70
     denkt
    -1.70
    🤣🤣🤣
    -1.68
    -1.68
    POSITIVE LOGITS
     bordering
    1.66
     Thankfully
    1.60
    4
    1.58
     them
    1.56
     Obviously
    1.56
     bestowed
    1.55
     trochę
    1.54
    ).
    1.50
     him
    1.47
    <i>
    1.45
    Act Density 24.680%

    No Known Activations