INDEX
    Explanations

    phrases related to authorizing something or giving permission

    special characters or symbols in the text

    New Auto-Interp
    Negative Logits
     fishes
    -0.75
     welf
    -0.71
     dock
    -0.70
    ãĥ¼ãĥĨãĤ£
    -0.66
     ignition
    -0.66
     greens
    -0.66
     Odin
    -0.65
     plaque
    -0.64
     antip
    -0.63
     charger
    -0.63
    POSITIVE LOGITS
    WHERE
    0.97
    wait
    0.93
    they
    0.91
    yet
    0.90
    there
    0.88
    well
    0.88
    our
    0.87
    where
    0.86
    cases
    0.85
    you
    0.84
    Act Density 0.036%

    No Known Activations