INDEX
    Explanations

    similarity or necessity

    New Auto-Interp
    Negative Logits
     necessarily
    -1.24
    necessarily
    -1.18
     necesariamente
    -1.16
     nécessairement
    -1.05
     necessariamente
    -1.02
    generally
    -0.94
     generally
    -0.91
     généralement
    -0.82
     potentially
    -0.79
     generalmente
    -0.79
    POSITIVE LOGITS
    ẵn
    0.55
     old
    0.53
    ूर
    0.52
    بح
    0.52
     answer
    0.51
     odd
    0.51
     letzter
    0.51
     لص
    0.50
    ::_('
    0.50
     FUNC
    0.50
    Act Density 0.035%

    No Known Activations