INDEX
    Explanations

    references to the color "red" in various contexts

    New Auto-Interp
    Negative Logits
    LookAnd
    -0.51
     MaterialApp
    -0.44
    principalColumn
    -0.41
     colgante
    -0.40
     AssemblyCompany
    -0.39
    点此举报
    -0.39
     urodz
    -0.39
     inalámbrico
    -0.39
    Carriera
    -0.38
     costado
    -0.37
    POSITIVE LOGITS
    acted
    0.60
    rawn
    0.58
    ACTED
    0.57
    dish
    0.55
    ouble
    0.54
    ditor
    0.54
    oub
    0.53
     flags
    0.53
     hot
    0.52
     carpet
    0.52
    Act Density 0.205%

    No Known Activations