INDEX
    Explanations

    references to authors and academic citations

    New Auto-Interp
    Negative Logits
    apesh
    -0.14
    γο
    -0.14
    Äįan
    -0.14
    apur
    -0.13
    arine
    -0.13
    orgia
    -0.13
    lamak
    -0.13
     âľĵ
    -0.13
    ìļ°
    -0.13
     forensic
    -0.13
    POSITIVE LOGITS
    196
    0.21
    197
    0.20
    198
    0.19
     Laugh
    0.18
    195
    0.16
     independently
    0.16
    ulis
    0.16
     USSR
    0.16
     Lands
    0.15
    affle
    0.15
    Act Density 0.161%

    No Known Activations