INDEX
    Explanations

    words related to ranks or titles

    punctuation and stylized text formats

    New Auto-Interp
    Negative Logits
    IDENT
    -0.64
    aminer
    -0.63
    Poll
    -0.63
    IRC
    -0.62
     Bomber
    -0.62
    Alert
    -0.60
    velt
    -0.59
     Platform
    -0.58
    LECT
    -0.58
    TPP
    -0.57
    POSITIVE LOGITS
    a
    0.87
    o
    0.72
    ahs
    0.67
    acia
    0.67
    ta
    0.65
    Ãł
    0.65
    és
    0.64
    ataka
    0.64
    nen
    0.61
    dds
    0.60
    Act Density 0.051%

    No Known Activations