INDEX
    Explanations

    phrases or words in a specific foreign language

    occurrences of a specific character or symbol

    New Auto-Interp
    Negative Logits
    ORED
    -0.85
     Sussex
    -0.76
    IFIED
    -0.73
     guiActiveUnfocused
    -0.68
     Jericho
    -0.65
     Mayweather
    -0.62
     actors
    -0.61
     Bullets
    -0.60
    URES
    -0.58
     bearer
    -0.58
    POSITIVE LOGITS
    ä
    1.23
    inen
    1.16
    ¢
    1.10
    1.00
    ·
    0.99
    ternity
    0.98
    ki
    0.94
    hl
    0.93
    tten
    0.90
    î
    0.90
    Act Density 0.013%

    No Known Activations