INDEX
    Explanations

    references to locations and geographical entities

    New Auto-Interp
    Negative Logits
     كومونز
    -0.57
    httphttps
    -0.52
    Rhestr
    -0.51
     يتيمه
    -0.50
    afficheront
    -0.47
    tagHelperRunner
    -0.47
     ब्रेकडाउन
    -0.45
     للاسماء
    -0.45
    IsContent
    -0.44
     بيها
    -0.44
    POSITIVE LOGITS
    <eos>
    1.00
     ✭✭
    0.45
     endregion
    0.43
    depart
    0.42
    orde
    0.42
     enfans
    0.41
     concludes
    0.41
    https
    0.41
     Kars
    0.40
     Италијани
    0.40
    Act Density 0.302%

    No Known Activations