INDEX
    Explanations

    hyphenated words and phrases

    phrases indicating negation or absence

    New Auto-Interp
    Negative Logits
     Ake
    -0.71
     Seeking
    -0.69
     Pok
    -0.68
     fade
    -0.68
     harshly
    -0.68
     moder
    -0.68
     Tik
    -0.65
     Levi
    -0.65
     geared
    -0.65
     cultiv
    -0.65
    POSITIVE LOGITS
    same
    1.27
    middle
    1.21
    mom
    1.20
    money
    1.16
    grain
    1.14
    scenes
    1.14
    prem
    1.14
    distance
    1.14
    surface
    1.12
    ground
    1.11
    Act Density 0.020%

    No Known Activations