INDEX
    Explanations

    mentions of personal experiences and actions in a conversational context

    first-person actions or reflections

    New Auto-Interp
    Negative Logits
    stateProvider
    -0.42
     themselves
    -0.41
    Poloha
    -0.39
    InstrumentedTest
    -0.39
     Compound
    -0.39
    Aiheesta
    -0.37
     they
    -0.36
     cose
    -0.35
     dosage
    -0.35
     disambiguazione
    -0.35
    POSITIVE LOGITS
     myself
    1.04
    myself
    0.96
     Myself
    0.85
    Myself
    0.84
     my
    0.74
     minhas
    0.68
     myſelf
    0.67
    我自己
    0.65
     mijn
    0.64
     meinem
    0.63
    Act Density 2.676%

    No Known Activations