INDEX
    Explanations

    phrases indicating personal reflections or disclosures

    New Auto-Interp
    Negative Logits
    __(/*!
    -0.67
    ftagPool
    -0.60
     Administrativna
    -0.59
    ebabkan
    -0.56
    rouvez
    -0.56
    ]=>
    -0.55
    $_['
    -0.55
     Italijanski
    -0.54
     eorum
    -0.54
    ταν
    -0.54
    POSITIVE LOGITS
    余談
    1.07
     quick
    1.01
     FYI
    0.99
     note
    0.92
    FYI
    0.92
    quick
    0.87
    Quick
    0.86
     siden
    0.85
    顺便
    0.83
     briefly
    0.82
    Act Density 0.341%

    No Known Activations