INDEX
    Explanations

    strong expressions of emotion and enthusiasm in responses

    New Auto-Interp
    Negative Logits
     незавершена
    -1.20
     nakalista
    -1.16
     صوتيه
    -1.06
    -1.04
    AccessorTable
    -1.02
     Roskov
    -1.02
     estekak
    -1.01
     виправивши
    -0.99
     Normdatei
    -0.98
    twimg
    -0.97
    POSITIVE LOGITS
      
    0.56
    0.46
    <eos>
    0.44
     http
    0.44
     it
    0.43
    _
    0.43
    ↵↵
    0.42
    [
    0.42
    0.40
    0.40
    Act Density 0.280%

    No Known Activations