INDEX
    Explanations

    phrases related to specific objects or concepts, such as "oxygen supplies" and "electricity grid"

    specific nouns and topics related to health, environment, and legal issues

    New Auto-Interp
    Negative Logits
    ulhu
    -0.69
    ggles
    -0.66
    agall
    -0.61
    âĺĨ
    -0.61
    ifice
    -0.60
    infeld
    -0.58
    umblr
    -0.58
    edy
    -0.55
    OHN
    -0.55
    vez
    -0.54
    POSITIVE LOGITS
    pox
    0.65
    vale
    0.57
     discrimination
    0.56
    worth
    0.54
    ãĥ
    0.53
    iculture
    0.53
     tattoo
    0.53
     billboards
    0.52
    bestos
    0.51
    shaw
    0.51
    Act Density 1.083%

    No Known Activations