INDEX
    Explanations

    references to significant actions or events related to a narrative

    New Auto-Interp
    Negative Logits
    Sklici
    -0.81
    ագրություններ
    -0.73
    Manbalar
    -0.71
    -0.67
    tfrac
    -0.67
    Atsauces
    -0.62
    তথ্যসূত্র
    -0.61
    olingo
    -0.61
    Referencoj
    -0.59
    etheless
    -0.59
    POSITIVE LOGITS
     engraçadas
    0.61
    mtrl
    0.59
     cemeteries
    0.57
     meninos
    0.56
     gewel
    0.55
     fasi
    0.55
     sacrificed
    0.55
     différente
    0.55
     dunes
    0.54
     vinto
    0.54
    Act Density 0.155%

    No Known Activations