INDEX
    Explanations
    New Auto-Interp
    Negative Logits
    و
    1.24
     to
    1.02
    ни
    0.94
    не
    0.91
    м
    0.89
    ले
    0.87
    to
    0.85
    ται
    0.85
    0.85
    ő
    0.80
    POSITIVE LOGITS
    s
    1.38
    a
    1.09
     vaccin
    0.98
    0.95
    0.94
     fiori
    0.90
     aviones
    0.89
    0.89
    药物
    0.87
     poziom
    0.86
    Act Density 0.005%

    No Known Activations