INDEX
    Explanations

    references to specific events or performances

    New Auto-Interp
    Negative Logits
    <bos>
    -2.80
    <?
    -1.12
    -1.00
    
    
    -0.99
    /**
    -0.88
     intersper
    -0.83
    /*
    -0.78
     quitted
    -0.75
     rehabilitate
    -0.74
     banish
    -0.73
    POSITIVE LOGITS
     karton
    1.49
     silikon
    1.42
     kafe
    1.40
     keramik
    1.38
     alkoh
    1.29
     seksi
    1.27
     kosme
    1.26
     uhr
    1.25
     optik
    1.16
     krim
    1.16
    Act Density 0.068%

    No Known Activations