INDEX
    Explanations

    phrases indicating existence or presence of conditions and situations

    New Auto-Interp
    Negative Logits
    llib
    -0.18
    kov
    -0.16
    InitialState
    -0.15
    shire
    -0.14
    rint
    -0.14
     ä½IJ
    -0.14
     there
    -0.13
    lot
    -0.13
     ch
    -0.13
    _PRESENT
    -0.13
    POSITIVE LOGITS
    quot
    0.16
    OPY
    0.14
    gewater
    0.13
    ialized
    0.13
    hta
    0.13
    Argb
    0.13
    itsu
    0.13
    idian
    0.13
    pecies
    0.13
    ofs
    0.13
    Act Density 0.109%

    No Known Activations