INDEX
    Explanations

    information related to news articles or reports about various political and social subjects

    punctuated segments of text that list items or topics

    New Auto-Interp
    Negative Logits
    quist
    -0.78
    uce
    -0.76
    inx
    -0.76
    iple
    -0.76
    anish
    -0.73
    uble
    -0.72
    earch
    -0.72
    ocry
    -0.71
    ances
    -0.71
    orne
    -0.70
    POSITIVE LOGITS
     namely
    1.41
     albeit
    0.93
     Spectre
    0.87
     respectively
    0.84
     which
    0.79
     viz
    0.78
     aptly
    0.74
     called
    0.74
     Watt
    0.73
     thereby
    0.73
    Act Density 0.317%

    No Known Activations