INDEX
    Explanations

    references to temporal contexts

    New Auto-Interp
    Negative Logits
    RTLI
    -0.53
    adaptiveStyles
    -0.47
    انيف
    -0.46
     ujednoznacz
    -0.46
    skrift
    -0.45
    ſchen
    -0.45
    ſcher
    -0.42
     propOrder
    -0.42
    Hentet
    -0.40
     Infórmanos
    -0.39
    POSITIVE LOGITS
     Gleichzeitig
    1.10
     samtidigt
    0.97
     gleichzeitig
    0.96
     samtidig
    0.93
     Simultaneously
    0.92
     zugleich
    0.91
     simultaneously
    0.86
     jednocześnie
    0.86
    nocześnie
    0.84
    同時に
    0.83
    Act Density 0.009%

    No Known Activations