INDEX
    Explanations

    variations of the word "individual" in different contexts

    New Auto-Interp
    Negative Logits
    eum
    -0.08
    atte
    -0.08
    kovi
    -0.07
    á»IJ
    -0.07
     poil
    -0.07
     Zem
    -0.07
     predecess
    -0.07
    .attach
    -0.07
    jev
    -0.06
     suce
    -0.06
    POSITIVE LOGITS
    /single
    0.11
     isolated
    0.09
     single
    0.08
    olated
    0.08
    isol
    0.08
     einzel
    0.07
     Single
    0.07
    single
    0.07
     isol
    0.07
    (single
    0.07
    Act Density 0.005%

    No Known Activations