INDEX
    Explanations
    New Auto-Interp
    Negative Logits
    Silence
    0.77
     Pathfinder
    0.74
     ومن
    0.74
     Productivity
    0.72
     اعظم
    0.71
    subfloat
    0.69
    calling
    0.69
     Silence
    0.69
     Speech
    0.68
    platform
    0.68
    POSITIVE LOGITS
     votes
    0.72
    性質
    0.71
     لائن
    0.69
    Cele
    0.69
     behandeling
    0.66
    を受ける
    0.64
    line
    0.63
     leta
    0.63
     lisse
    0.63
    제를
    0.62
    Act Density 0.016%

    No Known Activations