INDEX
Explanations
CSS properties related to background and color styling
New Auto-Interp
Negative Logits
ous
-0.15
centration
-0.14
Norris
-0.14
710
-0.14
Malcolm
-0.13
lush
-0.13
-chevron
-0.13
inem
-0.13
eut
-0.13
Claus
-0.13
POSITIVE LOGITS
gains
0.30
Dod
0.27
Dod
0.23
orch
0.23
bur
0.22
gain
0.22
dod
0.22
Gain
0.21
wheat
0.21
dee
0.21
Activations Density 0.010%