INDEX
    Explanations

    doing things like listening to music

    New Auto-Interp
    Negative Logits
     Presumably
    0.47
     presumably
    0.46
     debemos
    0.42
    cerning
    0.41
     మేము
    0.40
     tivemos
    0.40
     puisque
    0.39
     etmektedir
    0.39
     lizenz
    0.39
     selaku
    0.38
    POSITIVE LOGITS
     অন্তত
    0.62
    至少
    0.62
     vài
    0.61
     થો
    0.60
    尽可能
    0.59
    哪怕
    0.59
    简单
    0.57
     yourself
    0.57
     almeno
    0.56
     কয়েক
    0.55
    Act Density 0.052%

    No Known Activations