INDEX
    Explanations

    instances of numerical data and statistical scores

    New Auto-Interp
    Negative Logits
     Manson
    -0.17
    åĪ·
    -0.16
    ĩ¼
    -0.16
    eri
    -0.15
     Mellon
    -0.15
     Eleanor
    -0.15
    onet
    -0.15
     Caldwell
    -0.14
    رÙĪØ´
    -0.14
     Barcl
    -0.14
    POSITIVE LOGITS
    102
    0.36
    103
    0.34
    104
    0.32
    303
    0.27
    204
    0.25
    403
    0.25
    105
    0.25
    302
    0.24
    402
    0.24
    03
    0.24
    Act Density 0.023%

    No Known Activations