INDEX
    Explanations

    references to quantities or measurements

    New Auto-Interp
    Negative Logits
    зар
    -0.60
     Addis
    -0.59
     Lira
    -0.53
     Væ
    -0.53
    ;
    -0.53
    Mark
    -0.52
     écrite
    -0.52
    -0.51
     Obrigado
    -0.50
    work
    -0.50
    POSITIVE LOGITS
     OF
    1.44
     Of
    1.32
     of
    1.29
    Of
    1.23
    .)}
    1.20
    OF
    1.12
    .}(
    1.10
     của
    1.06
    of
    1.05
    オブ
    1.05
    Act Density 1.588%

    No Known Activations