INDEX
    Explanations

    providing or receiving information

    New Auto-Interp
    Negative Logits
     '
    -2.84
     "
    -2.78
    '
    -2.77
    </b>
    -2.55
     "(
    -2.34
     "'
    -2.22
     Teilen
    -2.16
     somber
    -2.16
     ezek
    -2.14
    i
    -2.14
    POSITIVE LOGITS
    .
    3.98
    2.91
    2.80
    2.67
    .")
    
    2.61
     itſelf
    2.58
    2.52
     muft
    2.50
    2.42
    2.42
    Act Density 0.006%

    No Known Activations