INDEX
    Explanations

    references to "mind" and related concepts involving thought processes or mental states

    New Auto-Interp
    Negative Logits
    миÑĢ
    -0.16
    instein
    -0.15
    baugh
    -0.15
    stoff
    -0.15
    banks
    -0.14
     Micha
    -0.14
    typed
    -0.14
    åĵ
    -0.14
     вне
    -0.14
    anou
    -0.13
    POSITIVE LOGITS
    ãģĭãģ®
    0.15
    perty
    0.15
    еÑĢÑĸв
    0.15
     rekl
    0.14
    .eye
    0.14
    perfect
    0.14
    pies
    0.14
     circulation
    0.13
    /plugins
    0.13
    langs
    0.13
    Act Density 0.014%

    No Known Activations