INDEX
    Explanations
    New Auto-Interp
    Negative Logits
    }{*}{
    -0.49
    yd
    -0.48
    <i>
    -0.48
    LT
    -0.44
    hil
    -0.41
    Hab
    -0.41
    7
    -0.41
     [
    -0.41
    imageio
    -0.41
    setOn
    -0.41
    POSITIVE LOGITS
     Duncan
    2.45
    Duncan
    2.36
    0.67
     kapture
    0.66
     miniaturka
    0.62
     Angus
    0.61
     Dunkin
    0.61
    Angus
    0.61
     turístico
    0.59
     Ucraina
    0.59
    Act Density 0.001%

    No Known Activations