INDEX
    Explanations

    items that indicate an explanation or list format

    New Auto-Interp
    Negative Logits
     propOrder
    -1.23
    Personensuche
    -1.07
     nahilalakip
    -0.95
    Portály
    -0.86
     Italijanski
    -0.85
    featureID
    -0.84
    -------------</
    -0.82
     ویکی‌پدیا
    -0.82
    SharedDtor
    -0.81
    таратура
    -0.81
    POSITIVE LOGITS
    .
    0.77
    <eos>
    0.69
    (
    0.68
    0.64
    *
    0.60
    ↵↵
    0.58
    "
    0.56
    0.55
    0.54
    ,
    0.54
    Act Density 0.303%

    No Known Activations