INDEX
    Explanations

    heading definitions and entities

    New Auto-Interp
    Negative Logits
     dönt
    0.47
     gamble
    0.45
     volition
    0.44
     congratulate
    0.44
     zerstört
    0.44
    Cread
    0.42
    oluminescence
    0.42
     devout
    0.42
     ended
    0.41
    sehen
    0.41
    POSITIVE LOGITS
     경로
    0.51
    0.51
    \}\
    0.49
    circledR
    0.49
    니다
    0.48
    0.48
     Realms
    0.47
    𝐸
    0.47
    वाहक
    0.47
    цыі
    0.47
    Act Density 0.000%

    No Known Activations