INDEX
    Explanations

    references to abstract concepts and existential inquiries

    "...thing" or "...thing."

    New Auto-Interp
    Negative Logits
     none
    -0.48
    none
    -0.46
    None
    -0.46
    rans
    -0.43
    ритори
    -0.43
    endwhile
    -0.43
    vyn
    -0.42
    ともに
    -0.41
    CreateModel
    -0.41
    TemporalType
    -0.40
    POSITIVE LOGITS
     thing
    3.78
     THING
    2.85
    thing
    2.79
     things
    2.66
     Thing
    2.66
    Thing
    2.50
     Things
    2.25
    things
    2.20
     THINGS
    2.19
    Things
    2.19
    Act Density 0.262%

    No Known Activations