INDEX
    Explanations
    New Auto-Interp
    Negative Logits
    dez
    -0.19
    nie
    -0.17
    mae
    -0.16
    unker
    -0.16
    merc
    -0.15
    gar
    -0.15
     çĿ
    -0.15
    gor
    -0.14
    xes
    -0.14
    nya
    -0.14
    POSITIVE LOGITS
     Rico
    0.26
     Rican
    0.20
    ekl
    0.17
     Ric
    0.16
    quoi
    0.16
     rico
    0.16
     Island
    0.15
    POSE
    0.15
     Morrow
    0.15
    PR
    0.14
    Act Density 0.003%

    No Known Activations