INDEX
    Explanations
    New Auto-Interp
    Negative Logits
    .pay
    -0.07
    WARE
    -0.07
    curities
    -0.07
    .currency
    -0.07
     mythical
    -0.06
     ions
    -0.06
    -0.06
    YYYY
    -0.06
    quality
    -0.06
    _flg
    -0.06
    POSITIVE LOGITS
     Ast
    0.16
     ast
    0.13
    ast
    0.11
    Ast
    0.11
    AST
    0.10
     AST
    0.10
    	ast
    0.09
    .ast
    0.08
    _AST
    0.08
     SST
    0.08
    Act Density 0.003%

    No Known Activations