INDEX
    Explanations
    New Auto-Interp
    Negative Logits
     einerseits
    -0.68
    contentLoaded
    -0.62
     متعلقه
    -0.60
    tvguidetime
    -0.59
    thansa
    -0.59
     kasarigan
    -0.58
    adpleegd
    -0.58
    EndGlobalSection
    -0.57
    thâu
    -0.57
    homonymie
    -0.57
    POSITIVE LOGITS
     or
    0.95
     atau
    0.66
    Alternatively
    0.65
     или
    0.64
     Alternatively
    0.63
     oder
    0.60
     outright
    0.60
     또는
    0.58
     หรือ
    0.58
    OrCreate
    0.57
    Act Density 0.030%

    No Known Activations