INDEX
    Explanations

    references to general items or concepts

    New Auto-Interp
    Negative Logits
     lágrimas
    -0.84
     gales
    -0.84
     hanem
    -0.83
     Heber
    -0.82
    UpInside
    -0.82
     larmes
    -0.82
    adecimal
    -0.80
     Sapphire
    -0.78
     วาด
    -0.78
     fhew
    -0.78
    POSITIVE LOGITS
     things
    2.66
    Things
    2.31
     Things
    2.30
     THINGS
    2.22
     thing
    2.18
    things
    2.14
     Thing
    1.95
     THING
    1.85
    Thing
    1.78
    THINGS
    1.75
    Act Density 0.060%

    No Known Activations