INDEX
    Explanations
    New Auto-Interp
    Negative Logits
    iastical
    -0.68
    AnchorStyles
    -0.66
    LookAnd
    -0.59
     Италијани
    -0.57
     propOrder
    -0.56
    estacks
    -0.56
     thiệu
    -0.55
    GOTREF
    -0.54
     تضيفلها
    -0.54
    InputBorder
    -0.52
    POSITIVE LOGITS
    ?
    2.34
    %?
    1.48
    ?”
    1.45
    ?!
    1.41
    ?"
    1.41
    ?</
    1.38
    ?
    
    1.38
    ؟
    1.38
    1.37
    ?)
    1.36
    Act Density 0.132%

    No Known Activations