INDEX
    Explanations
    New Auto-Interp
    Negative Logits
    ной
    0.95
    ى
    0.93
    ле
    0.92
    లు
    0.89
    лся
    0.89
    г
    0.86
     حسين
    0.85
    ю
    0.84
    но
    0.81
    يز
    0.79
    POSITIVE LOGITS
    founded
    1.01
    4
    0.97
    ato
    0.91
    a
    0.90
    ma
    0.90
     Entrepreneur
    0.89
     entrepreneur
    0.88
    0
    0.88
    -
    0.88
    ers
    0.88
    Act Density 0.009%

    No Known Activations