INDEX
    Explanations

    numeric values and dates

    New Auto-Interp
    Negative Logits
    ↵↵
    -0.16
    æĹıèĩªæ²»
    -0.15
    oth
    -0.15
    avanaugh
    -0.15
    434
    -0.14
     reve
    -0.14
    ÌĤ
    -0.14
     Trap
    -0.14
    нÑĮ
    -0.14
     net
    -0.14
    POSITIVE LOGITS
    _simps
    0.16
    mess
    0.16
    imet
    0.15
    ahy
    0.15
    ungle
    0.15
    addir
    0.15
    coh
    0.14
    mes
    0.14
    trag
    0.14
    licit
    0.14
    Act Density 0.003%

    No Known Activations