INDEX
    Explanations

    mentions of "Deep State."

    New Auto-Interp
    Negative Logits
     فريبيس
    -0.87
     purpoſe
    -0.85
     greateſt
    -0.84
     myſelf
    -0.83
     ſame
    -0.83
     $_"
    -0.81
     occaf
    -0.81
     houſe
    -0.80
     itſelf
    -0.79
     neceffary
    -0.79
    POSITIVE LOGITS
    deep
    1.30
    Deep
    1.29
     Deep
    1.27
     deep
    1.09
     profound
    1.05
    DEEP
    0.95
     DEEP
    0.92
    0.91
     deeply
    0.86
     deepest
    0.84
    Act Density 0.190%

    No Known Activations