INDEX
    Explanations

    references to historical conflicts and violent actions involving combative forces

    New Auto-Interp
    Negative Logits
    SizeF
    -0.57
    SPATH
    -0.56
    -0.47
     parseFrom
    -0.47
    AutoScale
    -0.47
    тся
    -0.45
    trägen
    -0.44
    benhavn
    -0.44
    LayoutStyle
    -0.44
     avail
    -0.44
    POSITIVE LOGITS
     neceff
    0.72
     pleaſure
    0.67
     poffe
    0.63
     fhew
    0.63
     fevere
    0.61
     مرئيه
    0.61
    ſelf
    0.59
    ghum
    0.59
     polenta
    0.58
    Transkript
    0.58
    Act Density 0.327%

    No Known Activations