INDEX
    Explanations

    phrases expressing a degree of judgment or opinion about various subjects

    New Auto-Interp
    Negative Logits
     Dee
    -0.15
    бина
    -0.14
    .kode
    -0.14
    rab
    -0.14
    emic
    -0.14
    akov
    -0.14
    ift
    -0.14
    fit
    -0.14
    ctr
    -0.13
     authority
    -0.13
    POSITIVE LOGITS
     compens
    0.15
    ÅĻet
    0.15
    گاÙĩÛĮ
    0.14
    .glide
    0.14
    IDGET
    0.14
    ома
    0.14
    æ»
    0.14
    .Invariant
    0.14
    tty
    0.14
    iges
    0.13
    Act Density 0.100%

    No Known Activations