INDEX
    Explanations

    mentions of different individuals within a particular context

    New Auto-Interp
    Negative Logits
    <bos>
    -1.09
     magnify
    -0.56
     thoughtless
    -0.55
     or
    -0.55
     and
    -0.53
    тол
    -0.51
     to
    -0.51
     bestow
    -0.51
     endeavouring
    -0.51
     unwarran
    -0.51
    POSITIVE LOGITS
     alkoh
    1.45
     kosme
    1.27
     kompati
    1.27
     antik
    1.27
     silikon
    1.27
     kram
    1.25
     optik
    1.23
     akut
    1.22
     praktik
    1.21
     logis
    1.20
    Act Density 0.106%

    No Known Activations