INDEX
    Explanations

    references to specific geographical locations, particularly cities and capitals

    New Auto-Interp
    Negative Logits
    atte
    -0.20
    xr
    -0.15
    tura
    -0.14
    edReader
    -0.14
    adera
    -0.14
    orna
    -0.14
    ave
    -0.14
    726
    -0.14
    face
    -0.14
    erner
    -0.14
    POSITIVE LOGITS
    redients
    0.16
    asmus
    0.16
    byn
    0.15
     personals
    0.14
    eration
    0.13
     legis
    0.13
    ãĤīãģĽ
    0.13
     PIO
    0.13
     civil
    0.13
    _mB
    0.13
    Act Density 0.088%

    No Known Activations