INDEX
    Explanations

    references to superhero characters and their associated narratives

    New Auto-Interp
    Negative Logits
    reed
    -0.17
    oby
    -0.15
    anca
    -0.14
     Highlander
    -0.14
    aeda
    -0.14
    zee
    -0.14
    asser
    -0.14
    omer
    -0.14
    ebin
    -0.14
    evin
    -0.14
    POSITIVE LOGITS
     اÙĦعربÙĬ
    0.15
    Flip
    0.14
    525
    0.14
     Spinner
    0.14
     MetroFramework
    0.14
    815
    0.14
    pekt
    0.14
     Dare
    0.14
     Flip
    0.13
    contra
    0.13
    Act Density 0.039%

    No Known Activations