INDEX
    Explanations
    New Auto-Interp
    Negative Logits
     клави
    -0.78
     الطب
    -0.77
    weeney
    -0.73
    ácií
    -0.73
     Tapi
    -0.72
     matin
    -0.69
     Boi
    -0.69
     vro
    -0.67
    uelva
    -0.67
     عنوان
    -0.66
    POSITIVE LOGITS
     Baden
    1.51
     Scouting
    1.45
     scouting
    1.23
     Scout
    1.22
     scout
    1.20
    Baden
    1.16
     Scouts
    1.13
     Rover
    1.09
     scouts
    1.05
     Rovers
    1.05
    Act Density 0.012%

    No Known Activations