INDEX
    Explanations

    references to brushes and brain-related terms

    New Auto-Interp
    Negative Logits
     GLS
    -0.84
    ArgsConstructor
    -0.83
     Lakeside
    -0.78
     Tides
    -0.77
     Akufo
    -0.77
     Hoyt
    -0.73
     Udaipur
    -0.72
    ذت
    -0.72
     WaitForSeconds
    -0.72
     Napole
    -0.71
    POSITIVE LOGITS
     BR
    1.04
     Br
    0.99
     brush
    0.95
     br
    0.94
     brushes
    0.93
     Bri
    0.92
     Bra
    0.91
     Brind
    0.89
     Brush
    0.89
    Br
    0.88
    Act Density 0.073%

    No Known Activations