INDEX
    Explanations

    statements indicating meaning or implications

    New Auto-Interp
    Negative Logits
     sight
    -0.57
    W
    -0.55
     hä
    -0.46
    ub
    -0.46
    Kob
    -0.46
    Cordialement
    -0.45
    自行
    -0.44
    py
    -0.43
     trends
    -0.43
    ynomial
    -0.42
    POSITIVE LOGITS
     مرئيه
    1.09
     للاسماء
    0.96
     MEANS
    0.93
    意味着
    0.92
     means
    0.91
    évaluateur
    0.88
    DockStyle
    0.88
     Means
    0.85
     artinya
    0.83
     gynhyrchwyd
    0.83
    Act Density 0.210%

    No Known Activations