INDEX
    Explanations
    New Auto-Interp
    Negative Logits
    -private
    -0.06
    -0.06
    enia
    -0.06
    _lc
    -0.06
     Cyc
    -0.06
    .om
    -0.06
    	pub
    -0.06
     bureauc
    -0.06
    テレビ
    -0.06
     peaceful
    -0.06
    POSITIVE LOGITS
     percentage
    0.07
    0.06
    astically
    0.06
    portion
    0.06
     최신
    0.06
    anchor
    0.06
     onComplete
    0.06
    itant
    0.06
     scholarship
    0.06
     knack
    0.06
    Act Density 0.003%

    No Known Activations