INDEX
    Explanations
    New Auto-Interp
    Negative Logits
    saco
    -0.50
    +#+#
    -0.49
     podjela
    -0.47
     noDo
    -0.45
    lsa
    -0.44
    kegaard
    -0.43
    httphttps
    -0.42
    💼
    -0.42
     przecież
    -0.42
     @"/
    -0.42
    POSITIVE LOGITS
    rungsseite
    0.46
    tribal
    0.41
    temun
    0.40
    úgó
    0.40
    pantalón
    0.39
    soil
    0.38
     mayor
    0.36
     Tester
    0.35
     fisherman
    0.35
     Hooks
    0.35
    Act Density 0.013%

    No Known Activations