INDEX
    Explanations
    New Auto-Interp
    Negative Logits
    comment
    -0.07
    ibern
    -0.07
     elephant
    -0.07
     BETWEEN
    -0.07
     пацієн
    -0.07
     แล
    -0.07
    евич
    -0.07
     Palestinian
    -0.07
    uisine
    -0.07
    	direction
    -0.06
    POSITIVE LOGITS
     core
    0.18
     Core
    0.16
     CORE
    0.14
    Core
    0.13
    core
    0.12
    CORE
    0.12
    -Core
    0.11
    .Core
    0.10
     cores
    0.10
     Corey
    0.10
    Act Density 0.017%

    No Known Activations