INDEX
    Explanations

    terms relating to user actions and capabilities

    New Auto-Interp
    Negative Logits
     kasarigan
    -0.69
    -0.63
    expandindo
    -0.60
    RTLR
    -0.56
    uxxxx
    -0.55
     surla
    -0.52
    SPJ
    -0.51
    GEBURTSDATUM
    -0.49
     nakalista
    -0.47
     initComponents
    -0.47
    POSITIVE LOGITS
     themselves
    0.85
     their
    0.71
    themselves
    0.63
    their
    0.57
     Their
    0.57
     ihre
    0.56
    Their
    0.56
     själva
    0.55
     ihren
    0.52
     ihr
    0.50
    Act Density 0.609%

    No Known Activations