INDEX
    Explanations
    New Auto-Interp
    Negative Logits
    adaptiveStyles
    -1.07
    s
    -0.86
     Wray
    -0.80
    imel
    -0.79
     calendriers
    -0.79
     Omer
    -0.78
     grun
    -0.77
    BeginContext
    -0.75
     Esquire
    -0.75
    σταση
    -0.74
    POSITIVE LOGITS
     Gorg
    0.86
     gall
    0.81
     Sorg
    0.80
     Bals
    0.79
    awtextra
    0.74
     postId
    0.71
     kaikk
    0.70
     Dz
    0.69
    ölf
    0.69
     NAZ
    0.67
    Act Density 1.004%

    No Known Activations