INDEX
    Explanations

    mentions of the speaker's feelings or emotional experiences

    New Auto-Interp
    Negative Logits
     للاسماء
    -0.66
    -0.63
     Савезне
    -0.54
    MessageTagHelper
    -0.52
    adpleegd
    -0.51
    \{\\
    -0.49
     MonoBehaviour
    -0.49
    帖最后由
    -0.48
     Paglinawan
    -0.48
     ویکی‌پدی
    -0.47
    POSITIVE LOGITS
     amitié
    0.40
    NOPQRST
    0.38
     fieltro
    0.38
    loved
    0.38
     feltro
    0.37
     autorité
    0.37
    と感じ
    0.36
     miłości
    0.35
    août
    0.35
     comédie
    0.35
    Act Density 0.036%

    No Known Activations