INDEX
    Explanations
    New Auto-Interp
    Negative Logits
    .sg
    -0.06
     freezes
    -0.06
    „
    -0.06
     Judgment
    -0.06
     Helpers
    -0.06
    محمد
    -0.06
    μένος
    -0.06
    ený
    -0.06
    -0.06
     mãe
    -0.06
    POSITIVE LOGITS
    BA
    0.09
    ba
    0.08
     arist
    0.07
    0.07
    fc
    0.06
    abilia
    0.06
     Carpet
    0.06
    geometry
    0.06
    Touchable
    0.06
     bitmap
    0.06
    Act Density 0.001%

    No Known Activations