INDEX
    Explanations

    references to specific individuals, particularly those with the name "Kat" or variations thereof

    New Auto-Interp
    Negative Logits
    ollo
    -0.15
     Lac
    -0.14
    iciary
    -0.14
    eca
    -0.14
     sac
    -0.14
    ICI
    -0.14
    539
    -0.14
    ắt
    -0.14
    ["_
    -0.14
    847
    -0.13
    POSITIVE LOGITS
    owitz
    0.19
    rink
    0.17
    ziej
    0.17
     vice
    0.15
    Flash
    0.14
    ainen
    0.14
    ä»ĺ
    0.14
    妮
    0.14
    apolis
    0.14
    ار
    0.14
    Act Density 0.032%

    No Known Activations