INDEX
    Explanations

    technical terminology and concepts related to scientific and academic subjects

    New Auto-Interp
    Negative Logits
    reur
    -0.14
    adesh
    -0.14
    à¥įण
    -0.13
    inson
    -0.13
     поÑĢ
    -0.13
    (HWND
    -0.12
    bruar
    -0.12
    ije
    -0.12
     tripod
    -0.12
    Ú©Ø´
    -0.12
    POSITIVE LOGITS
     like
    0.71
     Like
    0.58
     unlike
    0.51
    Like
    0.50
     LIKE
    0.46
     seperti
    0.44
    like
    0.44
     giá»ijng
    0.43
    åĥı
    0.42
     wie
    0.41
    Act Density 0.356%

    No Known Activations