INDEX
    Explanations
    New Auto-Interp
    Negative Logits
    ين
    1.05
    of
    1.02
     zwią
    0.98
    де
    0.95
     are
    0.94
     electrón
    0.92
    com
    0.91
    as
    0.89
    0.88
    0.87
    POSITIVE LOGITS
     I
    1.28
     Church
    1.02
     \
    1.00
    ot
    0.99
    ون
    0.95
    тэй
    0.93
    ود
    0.91
     not
    0.90
    ره
    0.83
    B
    0.79
    Act Density 0.002%

    No Known Activations