INDEX
    Explanations
    New Auto-Interp
    Negative Logits
     combined
    -0.06
     Little
    -0.06
     conced
    -0.06
    ynch
    -0.06
     curs
    -0.06
     kitab
    -0.06
    ycop
    -0.06
     krist
    -0.06
    -0.06
     Salvador
    -0.06
    POSITIVE LOGITS
    لاق
    0.07
     Parm
    0.06
     pasture
    0.06
     Builds
    0.06
    では
    0.06
    alarından
    0.06
     νεφοκ
    0.06
     primer
    0.06
     overlook
    0.06
    0.06
    Act Density 0.014%

    No Known Activations