INDEX
    Explanations

    phrases and structures indicating relationships and dynamics among characters or entities

    New Auto-Interp
    Negative Logits
    argas
    -0.16
    iez
    -0.15
    aina
    -0.15
    eck
    -0.15
     Gambling
    -0.15
    assadors
    -0.15
    ief
    -0.14
    beck
    -0.14
    izontal
    -0.14
    被
    -0.14
    POSITIVE LOGITS
     going
    0.42
    going
    0.35
     Going
    0.29
     gonna
    0.28
    -g
    0.27
    -going
    0.27
    Going
    0.27
     gon
    0.26
     gun
    0.25
     g
    0.24
    Act Density 0.098%

    No Known Activations