INDEX
    Explanations
    New Auto-Interp
    Negative Logits
     вопрос
    0.47
     नुक
    0.44
    roffen
    0.43
    🤌
    0.43
    0.42
     ультра
    0.41
    ZUKI
    0.41
     नव्या
    0.41
     மின்ன
    0.41
    0.41
    POSITIVE LOGITS
     acol
    0.45
     stewardship
    0.44
     restful
    0.42
     dominate
    0.41
     participate
    0.41
     quiet
    0.40
     contribute
    0.40
     asynchronous
    0.40
     passive
    0.40
    ものを
    0.40
    Act Density 0.003%

    No Known Activations