INDEX
    Explanations

    verbs indicating actions and experiences, particularly in a personal or historical context

    New Auto-Interp
    Negative Logits
    ſelves
    -0.68
     يتيمه
    -0.58
    kpop
    -0.56
    "}"
    -0.55
     Mult
    -0.54
     houſe
    -0.54
    __))
    -0.53
    ृत
    -0.53
    ("#{
    -0.53
    τέλε
    -0.53
    POSITIVE LOGITS
     himself
    1.40
     his
    1.32
     him
    1.16
    himself
    1.07
     His
    1.05
     He
    1.05
    His
    0.98
     Himself
    0.97
     he
    0.91
    的他
    0.91
    Act Density 0.711%

    No Known Activations