INDEX
    Explanations
    New Auto-Interp
    Negative Logits
     very
    -0.95
    kilde
    -0.94
     perhaps
    -0.89
     burgeoning
    -0.88
    !
    -0.88
     quaint
    -0.83
     agus
    -0.83
     great
    -0.83
    lateinit
    -0.82
     vanwege
    -0.82
    POSITIVE LOGITS
    arbeiten
    1.06
     griechischen
    1.03
    >(),
    1.02
     Dlatego
    1.00
     internetu
    1.00
     jednom
    1.00
     פאר
    1.00
     ktore
    0.99
     jste
    0.98
    considered
    0.97
    Act Density 0.006%

    No Known Activations