INDEX
    Explanations

    references to news sources or reporting entities in articles

    New Auto-Interp
    Negative Logits
    גר
    -0.41
    taza
    -0.41
     même
    -0.40
     mêmes
    -0.39
    </em>
    -0.39
     adult
    -0.38
     ar
    -0.38
     normaux
    -0.38
    -0.38
     stesse
    -0.37
    POSITIVE LOGITS
     nakalista
    1.03
    SequentialGroup
    0.92
    Personendaten
    0.82
    InjectAttribute
    0.81
    IANS
    0.80
     виправивши
    0.80
    rungsseite
    0.78
     समीक्षक
    0.77
     saites
    0.76
     CreateTagHelper
    0.76
    Act Density 0.009%

    No Known Activations