INDEX
    Explanations

    terms related to loss and disappointment

    New Auto-Interp
    Negative Logits
    ÙĬدة
    -0.15
    instead
    -0.15
    Ĵáŀ
    -0.15
     LENG
    -0.15
    servername
    -0.14
    .Euler
    -0.14
     thá»
    -0.14
     instead
    -0.14
    Td
    -0.13
    à¸ģว
    -0.13
    POSITIVE LOGITS
     due
    0.35
    due
    0.28
     altogether
    0.27
     Due
    0.23
     debido
    0.23
     because
    0.22
     بسبب
    0.21
     forever
    0.21
    Due
    0.21
    vido
    0.19
    Act Density 0.038%

    No Known Activations