INDEX
    Explanations
    New Auto-Interp
    Negative Logits
     cuộc
    -0.08
    _major
    -0.07
    _disc
    -0.06
    	tile
    -0.06
     visita
    -0.06
    -0.06
     -------------------------------------------------------------------------↵
    -0.06
     yeri
    -0.06
    .Auto
    -0.06
    .gc
    -0.06
    POSITIVE LOGITS
     makes
    0.10
    makes
    0.08
     Hath
    0.07
    -making
    0.07
     fortress
    0.07
    Provides
    0.07
     make
    0.06
     thinks
    0.06
     personals
    0.06
     Hurt
    0.06
    Act Density 0.033%

    No Known Activations