INDEX
    Explanations

    mentions of the word "Sultan."

    New Auto-Interp
    Negative Logits
     estekak
    -0.40
     noastre
    -0.39
     Goed
    -0.38
    gemaakt
    -0.36
     travaillons
    -0.33
    break
    -0.31
     デイ
    -0.31
    kyard
    -0.31
    Past
    -0.30
     اح
    -0.30
    POSITIVE LOGITS
     Sultan
    2.52
    Sultan
    2.31
     sultan
    2.05
     Sult
    1.40
     sult
    1.23
    ultan
    1.09
     سلط
    1.02
     السلط
    0.86
     Sullivan
    0.81
     SUL
    0.75
    Act Density 0.001%

    No Known Activations