INDEX
    Explanations

    occurrences of the word "knowledge" and its variations

    New Auto-Interp
    Negative Logits
    isha
    -0.16
    acci
    -0.15
    atters
    -0.15
    pekt
    -0.13
    ALIGN
    -0.13
    \<^
    -0.13
    acades
    -0.13
    ork
    -0.13
    487
    -0.13
    anno
    -0.13
    POSITIVE LOGITS
    .microsoft
    0.19
    fully
    0.17
    senal
    0.15
    ifar
    0.14
    ://%
    0.14
    crate
    0.14
    lenÃŃ
    0.14
    #af
    0.14
    owski
    0.14
    θο
    0.14
    Act Density 0.023%

    No Known Activations