INDEX
    Explanations
    New Auto-Interp
    Negative Logits
     C
    0.90
    на
    0.89
    on
    0.86
     H
    0.82
    ل
    0.79
     U
    0.79
    ни
    0.79
    ص
    0.78
    daki
    0.77
    ף
    0.77
    POSITIVE LOGITS
    Soviet
    1.34
     소련
    1.10
     Soviet
    1.09
     সোভ
    0.97
     Soviets
    0.92
    O
    0.89
     soviet
    0.87
    Russia
    0.87
     совет
    0.85
    苏联
    0.84
    Act Density 0.004%

    No Known Activations