INDEX
    Explanations

    instances of the word "flee" or its variations, indicating escape or flight scenarios

    New Auto-Interp
    Negative Logits
    eum
    -0.07
    frei
    -0.07
     poil
    -0.07
    oe
    -0.06
     (;;)
    -0.06
    .UnitTesting
    -0.06
    ofday
    -0.06
    outs
    -0.06
    izations
    -0.06
    xing
    -0.06
    POSITIVE LOGITS
     khá»ıi
    0.10
       
    0.07
    zik
    0.07
    omen
    0.06
    ت
    0.06
    entlich
    0.06
    504
    0.06
    pond
    0.06
    .references
    0.06
    kul
    0.06
    Act Density 0.005%

    No Known Activations