INDEX
    Explanations
    New Auto-Interp
    Negative Logits
    f
    1.39
    1.35
    ।.
    1.20
    o
    1.13
    יי
    1.09
    ו
    1.01
    いた
    0.99
    ка
    0.97
    もの
    0.97
    тра
    0.96
    POSITIVE LOGITS
     feet
    1.38
     be
    1.17
     as
    1.17
     foot
    1.15
     are
    1.04
     (
    1.00
     her
    0.97
     footprints
    0.96
     or
    0.95
     an
    0.94
    Act Density 0.046%

    No Known Activations