INDEX
    Explanations

    general information or details from news articles

    occurrences of the word "More."

    New Auto-Interp
    Negative Logits
    keeping
    -0.75
     2024
    -0.70
    liest
    -0.68
    Fram
    -0.67
    atan
    -0.67
    RL
    -0.65
    Enlarge
    -0.65
    itudes
    -0.65
    RN
    -0.64
    same
    -0.62
    POSITIVE LOGITS
     ado
    1.16
     Than
    1.10
     than
    1.04
     importantly
    1.03
     mature
    0.79
    than
    0.76
     important
    0.75
     sophisticated
    0.75
     extensive
    0.74
     stringent
    0.74
    Act Density 0.035%

    No Known Activations