INDEX
    Explanations

    references to individuals or groups in narratives

    New Auto-Interp
    Negative Logits
     Ing
    -0.17
    omp
    -0.15
    aurus
    -0.15
    زار
    -0.14
    vang
    -0.14
    omic
    -0.14
    LOCKS
    -0.14
    ilebilir
    -0.14
    ÈĽi
    -0.14
    ingen
    -0.14
    POSITIVE LOGITS
    tog
    0.15
    ziej
    0.15
    wand
    0.15
    licken
    0.15
    aln
    0.14
    codec
    0.14
    count
    0.14
    conc
    0.14
     Woj
    0.13
    asics
    0.13
    Act Density 0.122%

    No Known Activations