INDEX
    Explanations

    references to the Star Wars franchise and its related media

    New Auto-Interp
    Negative Logits
    hir
    -0.15
    Scene
    -0.14
     Tits
    -0.14
    avo
    -0.14
    nek
    -0.14
     Scene
    -0.14
    odium
    -0.14
    fov
    -0.13
    odi
    -0.13
    isay
    -0.13
    POSITIVE LOGITS
     universe
    0.34
     canon
    0.33
     continuity
    0.32
     univers
    0.29
     Universe
    0.28
     Canon
    0.28
     franchise
    0.28
     lore
    0.27
     cannon
    0.27
    Canon
    0.26
    Act Density 0.125%

    No Known Activations