INDEX
    Explanations

    money amounts and names

    New Auto-Interp
    Negative Logits
    ...
    -0.10
    Âł
    -0.10
    P
    -0.09
    A
    -0.08
    :
    -0.08
     ...
    -0.08
    B
    -0.08
    C
    -0.08
    E
    -0.08
    â̦
    -0.08
    POSITIVE LOGITS
    ¶Į
    0.15
    ¦æĥħ
    0.13
    ÂĢÂĢ
    0.12
    <|begin_of_text|>
    0.12
    EMPLARY
    0.11
    ³ç´°
    0.11
     -*-č\n
    0.11
    TRGL
    0.11
    DCALL
    0.10
    įng
    0.10
    Act Density 0.091%

    No Known Activations