INDEX
    Explanations

    phrases related to personal responsibility and risk

    New Auto-Interp
    Negative Logits
    iju
    -0.16
     Hoy
    -0.15
    иÑĤа
    -0.15
    adelphia
    -0.15
    baugh
    -0.15
    /licenses
    -0.14
    ubat
    -0.13
     Hunter
    -0.13
    addon
    -0.13
    /locale
    -0.13
    POSITIVE LOGITS
     at
    0.64
    	at
    0.37
     tại
    0.36
     expense
    0.35
    _at
    0.35
    èĩ³å°ij
    0.31
    At
    0.31
    expense
    0.29
    -at
    0.29
     Expense
    0.28
    Act Density 0.098%

    No Known Activations