INDEX
    Explanations

    phrases indicating a competitive or strategic benefit

    New Auto-Interp
    Negative Logits
    AddTagHelper
    -0.88
     Wicidata
    -0.80
    rungsseite
    -0.77
    verwijspagina
    -0.77
    ロウィン
    -0.77
    Портали
    -0.77
     desmotivaciones
    -0.76
     indígen
    -0.75
     للمعارف
    -0.75
    majánló
    -0.75
    POSITIVE LOGITS
     advantage
    0.69
    er
    0.69
     advantages
    0.60
    .
    0.55
    0.55
    ce
    0.55
    a
    0.54
    de
    0.54
    al
    0.54
    as
    0.53
    Act Density 0.233%

    No Known Activations