INDEX
    Explanations
    New Auto-Interp
    Negative Logits
     Strike
    -0.06
    -0.06
    )s
    -0.06
    Τ
    -0.06
     careful
    -0.06
     setback
    -0.06
    .Act
    -0.06
     office
    -0.06
    �u
    -0.06
     hdc
    -0.06
    POSITIVE LOGITS
    ogenous
    0.07
    Contrib
    0.07
    vez
    0.07
    <Client
    0.07
    .Encode
    0.06
     المن
    0.06
     имп
    0.06
     espan
    0.06
     чуж
    0.06
    origin
    0.06
    Act Density 0.004%

    No Known Activations