INDEX
    Explanations

    questions that express curiosity or uncertainty

    New Auto-Interp
    Negative Logits
     FontWeight
    -0.63
    ="@+
    -0.60
     INTERESAR
    -0.60
    addContainerGap
    -0.59
    bootstrapcdn
    -0.57
     ProtoMessage
    -0.55
    DoubleQuotes
    -0.55
     <>",
    -0.55
    Personensuche
    -0.51
    UnsafeEnabled
    -0.51
    POSITIVE LOGITS
    "])
    
    0.78
    ScopeManager
    0.67
    "]);
    
    0.67
    >");
    
    0.64
     متعلقه
    0.63
    !")
    
    0.61
    }")
    
    0.61
    '])
    
    0.60
    ɚ
    0.60
    )");
    
    0.59
    Act Density 0.287%

    No Known Activations