INDEX
    Explanations

    names of researchers and contributors in scientific articles

    New Auto-Interp
    Negative Logits
    unate
    -0.16
     chr
    -0.14
    olumn
    -0.14
    lius
    -0.14
    ียà¸ļ
    -0.14
    isz
    -0.14
     Benton
    -0.13
    etics
    -0.13
    gaard
    -0.13
    ¼
    -0.13
    POSITIVE LOGITS
     Fransa
    0.17
    ̣
    0.16
     Sailor
    0.16
     Ocak
    0.15
    ADX
    0.15
    iores
    0.14
     Shard
    0.14
    ACS
    0.14
    ãĥ¼ãĥŃ
    0.14
     Axis
    0.14
    Act Density 0.031%

    No Known Activations