INDEX
    Explanations

    mentions of specific locations or organizations

    New Auto-Interp
    Negative Logits
     regress
    -0.64
     dividend
    -0.63
    schild
    -0.61
    MENT
    -0.60
    WARD
    -0.58
     revelation
    -0.58
     tert
    -0.57
     respects
    -0.57
     Scythe
    -0.56
     neut
    -0.56
    POSITIVE LOGITS
    oga
    0.92
    oland
    0.82
    uan
    0.77
    oen
    0.76
    ulhu
    0.76
    otta
    0.74
    atti
    0.73
    ega
    0.72
    ractor
    0.72
    amac
    0.71
    Act Density 0.100%

    No Known Activations