INDEX
    Explanations

    references to stadiums and arenas

    New Auto-Interp
    Negative Logits
    <bos>
    -1.48
     intersper
    -0.81
     AssemblyCompany
    -0.72
    /**
    -0.70
     coar
    -0.66
     quitted
    -0.65
     trod
    -0.64
    -0.62
     darted
    -0.62
     curate
    -0.62
    POSITIVE LOGITS
     Stadium
    1.19
     stadium
    1.14
    Stadium
    1.14
    stadium
    1.08
     arena
    1.02
     Arena
    1.01
     stadiums
    0.95
    Arena
    0.92
    arena
    0.90
    liseum
    0.72
    Act Density 0.461%

    No Known Activations