INDEX
    Explanations

    specific examples

    instances where "for example" is used to introduce illustrative cases or explanations

    New Auto-Interp
    Negative Logits
    ressed
    -0.68
    sil
    -0.66
    emate
    -0.66
    ormal
    -0.65
     parliamentary
    -0.65
    ements
    -0.65
    ELY
    -0.64
    rive
    -0.64
    vell
    -0.62
    ogun
    -0.62
    POSITIVE LOGITS
     Takeru
    0.68
     Jenkins
    0.67
     Schn
    0.65
    =#
    0.64
     owing
    0.64
    iHUD
    0.63
    lihood
    0.63
    ãĥīãĥ©
    0.63
    Æ
    0.63
    ©¶æ¥µ
    0.63
    Act Density 0.020%

    No Known Activations