INDEX
    Explanations

    words related to political figures or entities

    mentions of "legislation" or related terms

    New Auto-Interp
    Negative Logits
    âķIJâķIJ
    -0.91
     Kindle
    -0.71
     à¨
    -0.66
    hower
    -0.65
    aukee
    -0.63
     Learns
    -0.63
    ãģ¦
    -0.62
    ciation
    -0.61
     AIR
    -0.61
    ï¸
    -0.61
    POSITIVE LOGITS
    itimate
    1.33
    isl
    1.28
     Leg
    1.10
    uin
    1.05
    acies
    1.04
    Leg
    1.04
    leg
    0.93
    acy
    0.92
    lore
    0.88
    busters
    0.84
    Act Density 0.011%

    No Known Activations