INDEX
    Explanations
    New Auto-Interp
    Negative Logits
    /#{
    -0.81
    =#{
    -0.80
     ';
    
    -0.69
    chel
    -0.68
     ‘
    -0.67
    hel
    -0.66
     Lud
    -0.66
     “
    -0.66
     Hol
    -0.65
    rea
    -0.65
    POSITIVE LOGITS
     argint
    0.93
     ainfi
    0.93
     econômica
    0.92
     تانيه
    0.92
     plufieurs
    0.91
     policiales
    0.87
     feroit
    0.86
    httphttps
    0.86
     للمعارف
    0.86
     calitate
    0.85
    Act Density 0.003%

    No Known Activations