INDEX
    Explanations

    disclaimers and statements regarding political neutrality

    New Auto-Interp
    Negative Logits
    ean
    -0.14
    ipt
    -0.14
    ores
    -0.13
    ilog
    -0.13
    ï¼Ł↵↵
    -0.13
    èo
    -0.13
    igo
    -0.13
    cept
    -0.13
    isans
    -0.13
    angen
    -0.13
    POSITIVE LOGITS
     feel
    0.41
    Feel
    0.41
     Feel
    0.41
    feel
    0.37
     Enjoy
    0.32
     enjoy
    0.31
    Enjoy
    0.31
     hope
    0.30
     please
    0.29
    Please
    0.28
    Act Density 0.295%

    No Known Activations