INDEX
    Explanations

    names of political figures

    New Auto-Interp
    Negative Logits
     rake
    -0.61
     Fram
    -0.61
    Ĥª
    -0.61
     adolesc
    -0.59
     FSA
    -0.58
    Detailed
    -0.57
     aggregation
    -0.57
     context
    -0.57
     Roundup
    -0.57
    bryce
    -0.56
    POSITIVE LOGITS
    vu
    0.78
    emort
    0.74
    warm
    0.73
    schild
    0.72
    anamo
    0.71
    ilver
    0.71
    iors
    0.69
    issance
    0.69
    gettable
    0.68
    enstein
    0.68
    Act Density 0.152%

    No Known Activations