INDEX
    Explanations

    the word "but" and its variations to signal contrast or contradiction

    New Auto-Interp
    Negative Logits
    Personensuche
    -0.84
     amphi
    -0.74
     Saxons
    -0.73
    Cyfeiriadau
    -0.71
     تضيفلها
    -0.70
     gynhyrchwyd
    -0.70
     noDo
    -0.68
    vece
    -0.68
    nocześnie
    -0.68
     encima
    -0.68
    POSITIVE LOGITS
    qxd
    0.55
    eqn
    0.52
     μ
    0.52
     UnityEngine
    0.52
    hets
    0.50
    Seine
    0.50
    paramref
    0.50
    %
    
    0.50
    Koordinaten
    0.50
    }")]
    0.49
    Act Density 0.025%

    No Known Activations