INDEX
    Explanations

    negatively-toned phrases related to caring or boredom

    indifference

    New Auto-Interp
    Negative Logits
    <bos>
    -0.57
    ,
    -0.50
     handleMessage
    -0.44
    ?
    -0.42
    rau
    -0.42
    tov
    -0.42
    ToInt
    -0.41
    .
    -0.41
     предпо
    -0.40
    ift
    -0.40
    POSITIVE LOGITS
     متعلقه
    0.91
    MLLoader
    0.91
     Мексичка
    0.90
    Personendaten
    0.89
    ChildScrollView
    0.88
     فريبيس
    0.87
    SourceChecksum
    0.85
    TagMode
    0.82
     Chwiliwch
    0.81
     myſelf
    0.79
    Act Density 0.734%

    No Known Activations