INDEX
    Explanations
    New Auto-Interp
    Negative Logits
     Böl
    -0.07
     toda
    -0.07
     meditation
    -0.07
    SEMB
    -0.06
     addition
    -0.06
     Myth
    -0.06
     blend
    -0.06
    PECT
    -0.06
     intuition
    -0.06
     jenom
    -0.06
    POSITIVE LOGITS
     car
    0.13
     Car
    0.12
    car
    0.11
    Car
    0.11
     cars
    0.10
    _car
    0.10
    .Car
    0.10
    -car
    0.09
     CAR
    0.09
    cars
    0.09
    Act Density 0.032%

    No Known Activations