INDEX
    Explanations

    personal opinions

    New Auto-Interp
    Negative Logits
     autorytatywna
    -0.78
    تقاوى
    -0.76
    expandindo
    -0.76
     beginnetje
    -0.73
    EDEFAULT
    -0.71
    -0.70
    Enllaces
    -0.70
    :✨
    -0.69
     ویکی‌پدی
    -0.69
    awaiter
    -0.69
    POSITIVE LOGITS
     ſtand
    0.77
     deſt
    0.72
     faſt
    0.69
    ſelves
    0.64
     ſet
    0.63
     fhew
    0.63
     themſelves
    0.63
     cauſe
    0.62
     juſ
    0.62
    zeba
    0.61
    Act Density 0.178%

    No Known Activations