INDEX
    Explanations
    New Auto-Interp
    Negative Logits
    gah
    -0.46
     RAM
    -0.42
     Alla
    -0.40
    ául
    -0.40
    emo
    -0.40
    év
    -0.40
    gum
    -0.39
    hall
    -0.39
    ень
    -0.39
    ства
    -0.39
    POSITIVE LOGITS
     ویکی‌پدی
    0.84
     صوتيه
    0.82
     CreateTagHelper
    0.80
     فريبيس
    0.78
     ModelExpression
    0.76
    GEBURTSDATUM
    0.75
     виправивши
    0.75
    mögens
    0.73
    +#+#
    0.72
     EconPapers
    0.71
    Act Density 0.054%

    No Known Activations