INDEX
    Explanations

    names of individuals or entities

    proper nouns and specific references to people, places, or organizations

    New Auto-Interp
    Negative Logits
    depending
    -0.45
    IBLE
    -0.44
    natureconservancy
    -0.41
    Cooldown
    -0.40
    ITNESS
    -0.39
    edIn
    -0.39
    FontSize
    -0.39
    laughs
    -0.39
    ©¶æ¥µ
    -0.36
    unless
    -0.36
    POSITIVE LOGITS
     and
    1.38
     &
    1.12
     AND
    1.00
    and
    0.87
     etc
    0.79
     et
    0.77
    &
    0.69
     or
    0.68
    -,
    0.67
    And
    0.64
    Act Density 2.681%

    No Known Activations