INDEX
    Explanations
    New Auto-Interp
    Negative Logits
    νονται
    -0.92
    Alzheimer
    -0.91
     tiennent
    -0.90
    cés
    -0.89
     oslo
    -0.88
    osť
    -0.87
     cultura
    -0.87
    そこは
    -0.86
    러나
    -0.85
     vagas
    -0.85
    POSITIVE LOGITS
     or
    1.56
     this
    1.33
     any
    1.13
     use
    1.05
     activities
    0.99
     arise
    0.99
     transactions
    0.98
     arises
    0.98
    การ
    0.98
     ANYTHING
    0.98
    Act Density 0.016%

    No Known Activations