INDEX
    Explanations

    references to the United States

    New Auto-Interp
    Negative Logits
    -0.54
    nung
    -0.47
    Bakgrunnsstoff
    -0.46
    ContentValues
    -0.45
     ngang
    -0.45
    uera
    -0.45
    ẨM
    -0.44
    ynthetic
    -0.44
    indd
    -0.43
     coagulation
    -0.43
    POSITIVE LOGITS
    stdc
    0.58
    Facades
    0.57
    μφωνα
    0.54
    日閲覧
    0.54
    новниш
    0.53
    __":
    
    0.53
     متحده
    0.52
    TagHelper
    0.52
    valdi
    0.51
     Vikipedi
    0.51
    Act Density 0.099%

    No Known Activations