INDEX
    Explanations

    questions starting with "How."

    New Auto-Interp
    Negative Logits
    '):
    
    -0.88
    `;
    
    -0.76
    ₁)
    -0.76
    "):
    
    -0.75
    tanleria
    -0.74
    GEBURTSDATUM
    -0.74
    _
    
    -0.73
    ftagPool
    -0.72
    ':
    
    -0.72
    ".
    
    -0.72
    POSITIVE LOGITS
     How
    2.19
    How
    2.19
    HOW
    1.37
     HOW
    1.35
    how
    1.32
    Why
    1.21
     Why
    1.16
    What
    1.16
    Cómo
    1.13
    Hvordan
    1.12
    Act Density 0.090%

    No Known Activations