INDEX
    Explanations

    general news or reporting about various topics and events

    questions regarding comparisons and contrasts in various contexts

    New Auto-Interp
    Negative Logits
    ©¶æ¥µ
    -0.64
    APD
    -0.62
    ullivan
    -0.59
    inery
    -0.58
    agraph
    -0.57
    jong
    -0.57
    ctuary
    -0.56
    mast
    -0.54
    iors
    -0.54
    DERR
    -0.53
    POSITIVE LOGITS
    ?
    2.42
    )?
    2.42
    ?"
    2.20
    ?:
    2.18
    ?",
    2.14
    "?
    2.14
    '?
    2.12
    ?),
    2.12
    ?).
    2.11
    ?!
    2.10
    Act Density 1.189%

    No Known Activations