INDEX
    Explanations
    New Auto-Interp
    Negative Logits
    awtextra
    -0.45
     oczeki
    -0.43
     Bradley
    -0.40
     EconPapers
    -0.40
    pptx
    -0.39
    hydrox
    -0.39
    ยัง
    -0.39
    wel
    -0.38
     noDo
    -0.36
    ăl
    -0.36
    POSITIVE LOGITS
    fillType
    0.60
     BorderSide
    0.58
    CloseOperation
    0.52
    jooq
    0.50
    getOut
    0.49
     outward
    0.48
     outbound
    0.48
    serviceWorker
    0.48
     outwards
    0.48
    出去
    0.47
    Act Density 0.004%

    No Known Activations