INDEX
    Explanations

    references to colors, primarily focusing on variants of white

    New Auto-Interp
    Negative Logits
    LError
    -0.62
     Leck
    -0.60
     Journeys
    -0.59
    decade
    -0.59
     Lend
    -0.59
    ScopeManager
    -0.58
     ويكيپيديا
    -0.58
    edip
    -0.56
    relationship
    -0.56
     Leap
    -0.55
    POSITIVE LOGITS
    White
    1.15
     Putih
    1.14
     White
    1.14
     white
    1.04
    white
    1.04
     WHITE
    1.02
    WHITE
    0.98
     Whites
    0.87
    Whites
    0.84
     whites
    0.82
    Act Density 0.117%

    No Known Activations