INDEX
    Explanations
    New Auto-Interp
    Negative Logits
    الحياه
    -0.64
     Baillargeon
    -0.59
     ivelany
    -0.57
     Kühl
    -0.53
    strokeWeight
    -0.51
    WSER
    -0.51
    τουργ
    -0.51
     egret
    -0.51
    raisemb
    -0.50
    niosek
    -0.50
    POSITIVE LOGITS
     Fix
    0.83
     Stuck
    0.82
     stuck
    0.82
    Fix
    0.80
    Stuck
    0.79
     fix
    0.79
     locked
    0.76
    fix
    0.73
     fixed
    0.73
     fixing
    0.69
    Act Density 0.225%

    No Known Activations