INDEX
    Explanations
    New Auto-Interp
    Negative Logits
    ftagPool
    -0.65
     Chwiliwch
    -0.64
     الحره
    -0.64
    RectangleBorder
    -0.63
    DockStyle
    -0.62
    BufferException
    -0.62
    MLLoader
    -0.62
    原始内容存档于
    -0.57
    Décès
    -0.57
    новниш
    -0.56
    POSITIVE LOGITS
    being
    0.64
    Being
    0.63
     being
    0.56
     BEING
    0.55
     Being
    0.54
    Becoming
    0.51
     להיות
    0.45
     becoming
    0.45
     Becoming
    0.42
    becoming
    0.42
    Act Density 0.012%

    No Known Activations