INDEX
    Explanations

    instances of the phrase "Il" or similar structures in the text

    New Auto-Interp
    Negative Logits
    wich
    -0.19
    warz
    -0.16
     Abbott
    -0.16
    Abb
    -0.15
    abb
    -0.15
    ardon
    -0.15
    θι
    -0.14
    ä¹±
    -0.14
    иÑĪ
    -0.14
    ilha
    -0.14
    POSITIVE LOGITS
    á»ijt
    0.15
     STRICT
    0.15
    ObjectType
    0.15
    rane
    0.14
    NSNotification
    0.14
    swer
    0.14
     Waist
    0.14
     Tyto
    0.14
     DÃŃky
    0.14
    æĪ¸
    0.13
    Act Density 0.004%

    No Known Activations