INDEX
    Explanations

    occurrences of the word "our"

    New Auto-Interp
    Negative Logits
    cue
    -0.15
    anko
    -0.14
     Mond
    -0.14
    ohana
    -0.14
    ngen
    -0.14
    978
    -0.14
    esco
    -0.14
    اضر
    -0.14
     purs
    -0.13
    éħį
    -0.13
    POSITIVE LOGITS
    los
    0.18
    ÌĢ
    0.16
    illis
    0.15
    pec
    0.15
    WithString
    0.14
    à¥Īल
    0.14
    azes
    0.14
    egr
    0.14
    jes
    0.14
    ýš
    0.14
    Act Density 0.042%

    No Known Activations