INDEX
    Explanations

    phrases related to important or controversial topics

    references to subjects or topics of discussion or debate

    New Auto-Interp
    Negative Logits
     Towers
    -0.80
    asters
    -0.73
    iaries
    -0.72
    anova
    -0.71
    orters
    -0.69
    poons
    -0.69
    ãĥ³ãĤ¸
    -0.67
    dp
    -0.66
    borg
    -0.63
    endars
    -0.63
    POSITIVE LOGITS
     itself
    0.78
     urgently
    0.72
    atics
    0.71
    href
    0.71
     posed
    0.68
     squarely
    0.67
     stemmed
    0.65
    forth
    0.65
    fulness
    0.64
     vigorously
    0.63
    Act Density 0.157%

    No Known Activations