INDEX
    Explanations

    the word "language" or a set phrase including it

    New Auto-Interp
    Negative Logits
    клопе
    -1.04
     myſelf
    -1.02
     auffi
    -1.00
     whoſe
    -0.99
     Efq
    -0.99
     leſs
    -0.96
    InjectAttribute
    -0.95
     Chwiliwch
    -0.95
     propOrder
    -0.95
    windowFixed
    -0.94
    POSITIVE LOGITS
    0.52
    <strong>
    0.50
     $
    0.50
     F
    0.50
     E
    0.46
     W
    0.45
     G
    0.42
     http
    0.42
     C
    0.42
     structures
    0.42
    Act Density 0.006%

    No Known Activations