INDEX
    Explanations

    references to financial concerns and decisions

    New Auto-Interp
    Negative Logits
    kus
    -0.15
    dera
    -0.15
    ifen
    -0.14
    _('
    -0.14
    zelf
    -0.13
    nth
    -0.13
    ActionCreators
    -0.13
    venes
    -0.13
    olland
    -0.13
    eward
    -0.13
    POSITIVE LOGITS
     these
    0.27
     this
    0.22
    these
    0.20
    è¿ĻäºĽ
    0.19
     them
    0.19
     These
    0.18
     it
    0.17
     said
    0.16
     they
    0.16
     she
    0.16
    Act Density 9.420%

    No Known Activations