INDEX
    Explanations

    quantitative data and statistical references

    New Auto-Interp
    Negative Logits
    ÑĢави
    -0.15
    567
    -0.15
    illet
    -0.15
    елиÑĩ
    -0.15
    arah
    -0.14
    инÑĥ
    -0.14
    ouv
    -0.14
    .synthetic
    -0.14
    åħ¶ä¸Ń
    -0.14
    _TEXTURE
    -0.14
    POSITIVE LOGITS
     therefore
    0.21
     because
    0.20
     pois
    0.19
    po
    0.19
     pues
    0.19
     mal
    0.18
     puesto
    0.18
     porque
    0.18
     ni
    0.17
    åĽłä¸º
    0.17
    Act Density 0.092%

    No Known Activations