INDEX
    Explanations

    structured data references and identifiers

    New Auto-Interp
    Negative Logits
    lexical
    -0.18
     libertarian
    -0.17
    éľ²
    -0.16
     leak
    -0.15
    literal
    -0.15
     leaks
    -0.15
    λοι
    -0.14
     legis
    -0.14
     libre
    -0.14
    .po
    -0.14
    POSITIVE LOGITS
    (IL
    0.32
     SL
    0.32
     AL
    0.31
     TL
    0.31
     VL
    0.31
     PL
    0.30
     JL
    0.30
     LL
    0.30
     CL
    0.30
     DL
    0.30
    Act Density 0.223%

    No Known Activations