INDEX
    Explanations

    statements and reports from various sources

    New Auto-Interp
    Negative Logits
     Arab
    -0.15
    ewriter
    -0.14
    esel
    -0.14
    oser
    -0.13
    imates
    -0.13
    ormal
    -0.13
    ynom
    -0.13
     Silent
    -0.13
    avin
    -0.13
    opher
    -0.13
    POSITIVE LOGITS
    iges
    0.15
    508
    0.15
    ISA
    0.15
    -UA
    0.14
    íıŃ
    0.14
    inspace
    0.14
    577
    0.14
    lendi
    0.14
     SND
    0.14
    NAS
    0.13
    Act Density 0.049%

    No Known Activations