INDEX
    Explanations

    mentions of political figures, specifically Narendra Modi and Amit Shah

    New Auto-Interp
    Negative Logits
    occo
    -0.07
    ipt
    -0.07
    nett
    -0.07
    erves
    -0.07
     Hispan
    -0.06
    lectic
    -0.06
    onical
    -0.06
    óm
    -0.06
    .metro
    -0.06
    .files
    -0.06
    POSITIVE LOGITS
     mitt
    0.07
     OV
    0.06
    isti
    0.06
     Mell
    0.06
    Pa
    0.06
    HCI
    0.06
    jer
    0.06
     Hind
    0.06
    #ad
    0.06
    angl
    0.06
    Act Density 0.002%

    No Known Activations