INDEX
    Explanations
    New Auto-Interp
    Negative Logits
    -
    1.28
    ur
    1.11
    ado
    1.05
    ية
    1.05
    is
    0.99
    ды
    0.98
    aneous
    0.95
    них
    0.92
    were
    0.89
    nya
    0.88
    POSITIVE LOGITS
    ت
    1.15
     hydro
    1.03
     hyd
    1.01
     hydropon
    0.96
     Hydro
    0.95
    Đ
    0.95
     n
    0.95
    Hyd
    0.93
    "
    0.92
     on
    0.91
    Act Density 0.009%

    No Known Activations