INDEX
    Explanations
    New Auto-Interp
    Negative Logits
     eventuele
    -0.10
     dingen
    -0.09
     schle
    -0.08
     verwachting
    -0.08
     eventueel
    -0.08
    至少
    -0.08
     houding
    -0.08
     chéile
    -0.08
     श्र
    -0.07
     hacia
    -0.07
    POSITIVE LOGITS
     neuron
    0.08
     CEOs
    0.08
     Ever
    0.07
     NPR
    0.07
    Knight
    0.07
    Unexpected
    0.07
     Knight
    0.07
     Gur
    0.07
    Governor
    0.07
     Nobel
    0.07
    Act Density 0.071%

    No Known Activations