INDEX
    Explanations

    the word "only" in various contexts

    New Auto-Interp
    Negative Logits
     externi
    -0.58
     oppure
    -0.53
     meurt
    -0.52
     magari
    -0.50
     fuoco
    -0.50
     véhic
    -0.49
     bower
    -0.49
    多人
    -0.49
     souhaite
    -0.49
     Nähe
    -0.49
    POSITIVE LOGITS
    satunya
    1.32
     jedin
    0.99
    唯一的
    0.93
    唯一
    0.87
     enige
    0.86
     eneste
    0.81
     einzigen
    0.78
     sole
    0.77
     Signalez
    0.76
     един
    0.76
    Act Density 0.160%

    No Known Activations