INDEX
    Explanations

    references to quantity and collective experiences

    New Auto-Interp
    Negative Logits
    aira
    -0.17
    isses
    -0.16
    jec
    -0.14
     Evet
    -0.14
    bach
    -0.14
    ager
    -0.13
    ëĭ¥
    -0.13
    asts
    -0.13
    ères
    -0.13
    orrh
    -0.13
    POSITIVE LOGITS
     ones
    0.24
     Ones
    0.21
     them
    0.19
     are
    0.19
    ones
    0.18
    them
    0.17
     were
    0.16
     others
    0.16
    avou
    0.15
     Ø¢ÙĨÙĩا
    0.15
    Act Density 0.115%

    No Known Activations