INDEX
    Explanations
    New Auto-Interp
    Negative Logits
     Theſe
    -0.79
     ſmall
    -0.78
     للمعارف
    -0.77
     myſelf
    -0.75
     Monfieur
    -0.74
     iſt
    -0.73
     ―――――
    -0.73
     uſed
    -0.70
     itſelf
    -0.69
     diſt
    -0.69
    POSITIVE LOGITS
     lenker
    0.58
    +#+
    0.50
    ErrorException
    0.43
     seva
    0.42
     (
    0.42
    ograf
    0.42
    0.41
     Ther
    0.41
      
    0.41
     pinulongan
    0.40
    Act Density 0.091%

    No Known Activations