INDEX
    Explanations

    names or references related to Islamic culture and figures

    New Auto-Interp
    Negative Logits
    ensch
    -0.19
    azzi
    -0.19
    @js
    -0.18
    Ŀi
    -0.16
    .scalablytyped
    -0.16
    PFN
    -0.16
    λογία
    -0.15
    jong
    -0.15
    REFERRED
    -0.15
    oden
    -0.15
    POSITIVE LOGITS
    гÑĥ
    0.16
    (s
    0.14
    de
    0.14
    buz
    0.14
     Voc
    0.14
    .liferay
    0.14
    et
    0.14
     battle
    0.13
     telling
    0.13
     cat
    0.13
    Act Density 0.073%

    No Known Activations