INDEX
    Explanations

    names or variables that are abbreviated using the format initial<tab>capitalized

    New Auto-Interp
    Negative Logits
    <bos>
    -0.76
    .
    -0.60
    !
    -0.59
     and
    -0.59
     a
    -0.58
     or
    -0.58
     of
    -0.58
     (
    -0.58
     s
    -0.57
     to
    -0.57
    POSITIVE LOGITS
     alkoh
    1.75
     kram
    1.67
     makro
    1.61
     silikon
    1.58
     keramik
    1.56
     uhr
    1.55
     antik
    1.53
     maksi
    1.53
     kac
    1.53
     kompati
    1.52
    Act Density 0.124%

    No Known Activations