INDEX
    Explanations
    New Auto-Interp
    Negative Logits
    _attrs
    -0.08
    _inner
    -0.08
     Turkey
    -0.08
     nonprofits
    -0.08
     philanthropic
    -0.08
    _dyn
    -0.08
     counseling
    -0.07
    Funcs
    -0.07
     Dover
    -0.07
    Attrs
    -0.07
    POSITIVE LOGITS
     query
    0.12
    	query
    0.12
    query
    0.11
    查询
    0.11
     查询
    0.11
    _QUERY
    0.11
     Syntax
    0.10
    $query
    0.10
     querying
    0.10
     Queries
    0.10
    Act Density 0.005%

    No Known Activations