INDEX
    Explanations
    New Auto-Interp
    Negative Logits
     another
    -0.99
    another
    -0.79
    saraba
    -0.75
     ANOTHER
    -0.70
    otro
    -0.68
     Another
    -0.67
    hematical
    -0.67
    følgelig
    -0.67
    อีก
    -0.66
    Another
    -0.65
    POSITIVE LOGITS
    er
    0.54
    PerformLayout
    0.54
    ка
    0.52
    Predecesor
    0.52
    ple
    0.50
     organism
    0.50
     Esperanto
    0.50
    o
    0.50
    чок
    0.49
     entity
    0.49
    Act Density 0.059%

    No Known Activations