INDEX
    Explanations
    New Auto-Interp
    Negative Logits
    homonymie
    -0.54
    (!__
    -0.52
    WIRE
    -0.48
    Gene
    -0.44
     Gene
    -0.44
    няка
    -0.44
    >{@
    -0.42
    firewall
    -0.42
     Wire
    -0.41
    Історія
    -0.41
    POSITIVE LOGITS
    GraphicsUnit
    0.66
     Normdatei
    0.65
    mability
    0.61
     onCancelled
    0.61
    gants
    0.60
    RTGC
    0.58
    writeField
    0.57
    antd
    0.56
    oweit
    0.56
    reddits
    0.55
    Act Density 0.002%

    No Known Activations