INDEX
    Explanations

    references to heroic figures or characters

    references to heroes in various contexts

    New Auto-Interp
    Negative Logits
    aeda
    -0.90
    orie
    -0.85
    ntil
    -0.84
    ateur
    -0.82
    ño
    -0.78
    rupt
    -0.76
    aton
    -0.76
    igree
    -0.73
    imentary
    -0.73
    vant
    -0.72
    POSITIVE LOGITS
     heroes
    0.93
     heroine
    0.83
    ku
    0.77
     Reborn
    0.75
     hero
    0.74
    ically
    0.74
    acters
    0.69
     Saur
    0.65
    rities
    0.65
     Pengu
    0.64
    Act Density 0.012%

    No Known Activations