INDEX
    Explanations

    words related to laws or regulations

    instances of specific letters or characters in the text

    New Auto-Interp
    Negative Logits
     Watkins
    -0.72
     KP
    -0.70
    Ĥİ
    -0.69
     Kau
    -0.67
     KS
    -0.66
     Kern
    -0.66
     ASA
    -0.65
     Leilan
    -0.65
     Leopard
    -0.64
     Tactics
    -0.64
    POSITIVE LOGITS
    vernment
    1.02
    ancial
    0.89
    ̶
    0.85
    undred
    0.84
    É
    0.83
    roc
    0.80
    actly
    0.80
    oyal
    0.80
    ploy
    0.80
    iable
    0.79
    Act Density 0.172%

    No Known Activations