INDEX
    Explanations

    statements or claims made by individuals

    New Auto-Interp
    Negative Logits
     perfected
    -0.15
    ifu
    -0.15
    cus
    -0.14
    beg
    -0.14
     Fiscal
    -0.14
    tog
    -0.14
    ìļ±
    -0.14
    æĺŃåĴĮ
    -0.13
    atron
    -0.13
    ella
    -0.13
    POSITIVE LOGITS
    lice
    0.15
    stroy
    0.15
    ļĮ
    0.15
    bove
    0.15
    uste
    0.14
    XB
    0.14
     dap
    0.14
    538
    0.14
    resa
    0.14
    sea
    0.14
    Act Density 0.764%

    No Known Activations