INDEX
    Explanations

    quantitative data related to measurements and numerical values

    New Auto-Interp
    Negative Logits
    .
    -0.46
    (
    -0.35
     points
    -0.34
     nomb
    -0.33
     arkas
    -0.33
     it
    -0.33
    ลง
    -0.32
     structure
    -0.32
     I
    -0.31
     Do
    -0.31
    POSITIVE LOGITS
    featureID
    0.82
     $_"
    0.81
     Administrativna
    0.79
     queſta
    0.76
    iſchen
    0.72
    LEncoder
    0.71
    ImageContext
    0.70
     beſch
    0.68
    ロウィン
    0.67
     Wikimedijinoj
    0.67
    Act Density 0.207%

    No Known Activations