INDEX
    Explanations

    enumerations or categorizations in the text

    New Auto-Interp
    Negative Logits
    iyon
    -0.15
    linger
    -0.14
    etc
    -0.14
     Canter
    -0.14
    efa
    -0.14
    kus
    -0.14
    UNUSED
    -0.14
     etc
    -0.14
    trait
    -0.13
    	Copyright
    -0.13
    POSITIVE LOGITS
    :
    0.25
    :↵
    0.23
     depending
    0.22
    :↵↵
    0.21
    ãĢĤä¸Ģ
    0.19
    .First
    0.18
    depending
    0.18
    :č↵
    0.16
    :</
    0.16
     ():
    0.16
    Act Density 0.093%

    No Known Activations