INDEX
    Explanations

    quantitative survey results and statistical findings

    New Auto-Interp
    Negative Logits
    featureID
    -0.55
    !*\
    -0.52
    "":
    -0.52
    -0.50
    HandlerContext
    -0.50
    -0.50
    `"
    -0.50
    hone
    -0.49
    UCI
    -0.49
    VersionUID
    -0.49
    POSITIVE LOGITS
     infatti
    0.45
     addirittura
    0.41
     misalnya
    0.41
     namorado
    0.41
     sogar
    0.40
    rungsseite
    0.39
     jopa
    0.39
     nawet
    0.39
     notamment
    0.38
     zelfs
    0.38
    Act Density 0.148%

    No Known Activations