INDEX
    Explanations

    proper nouns related to companies or organizations

    mentions of "Big" followed by numerical values, particularly those referring to groups or organizations

    New Auto-Interp
    Negative Logits
     confir
    -0.94
    idency
    -0.87
    anwhile
    -0.86
    yrim
    -0.82
    Downloadha
    -0.78
    theless
    -0.77
    veyard
    -0.73
     guiActiveUn
    -0.73
    istry
    -0.72
    izabeth
    -0.71
    POSITIVE LOGITS
    gest
    1.29
    ger
    1.14
    glers
    0.87
    ging
    0.86
    gers
    0.85
    gins
    0.84
     Brother
    0.83
    gie
    0.83
    Integer
    0.81
     Daddy
    0.78
    Act Density 0.017%

    No Known Activations