INDEX
    Explanations

    phrases related to various topics such as education, politics, crime, and environmental issues

    phrases and terms related to recent events and processes

    New Auto-Interp
    Negative Logits
    /-
    -0.59
    hani
    -0.55
    nces
    -0.52
    ATURES
    -0.49
    utenberg
    -0.48
    pixel
    -0.48
    aily
    -0.48
    kos
    -0.47
     Caption
    -0.46
    -)
    -0.45
    POSITIVE LOGITS
     guise
    0.59
     vicinity
    0.51
     backdrop
    0.50
    esville
    0.49
    widget
    0.49
     turbulent
    0.47
     workplace
    0.46
     unregulated
    0.46
     foreseeable
    0.45
     unfamiliar
    0.44
    Act Density 1.427%

    No Known Activations