INDEX
    Explanations

    parts of phrases indicating personal statements or opinions

    New Auto-Interp
    Negative Logits
    lech
    -0.15
    å¸Į
    -0.14
    efs
    -0.14
    oui
    -0.14
    rior
    -0.14
    .ide
    -0.14
    687
    -0.14
    ÏĥÏĦο
    -0.14
    éŀ
    -0.13
    ej
    -0.13
    POSITIVE LOGITS
    //{{
    0.17
    inus
    0.16
    336
    0.16
    ChangeListener
    0.15
    arger
    0.15
    anuts
    0.15
    InlineData
    0.15
    sWith
    0.14
     Ends
    0.14
     деÑĢев
    0.14
    Act Density 0.021%

    No Known Activations