INDEX
Explanations
button class names in HTML code
New Auto-Interp
Negative Logits
Puck
-0.75
Puck
-0.71
tation
-0.70

-0.70
eX
-0.68
Tung
-0.68
Crow
-0.67
tung
-0.66
That
-0.64
Bade
-0.64
POSITIVE LOGITS
btn
1.76
btn
1.76
Btn
1.03
Btn
1.01
Yeats
0.93
Btns
0.89
GenerationType
0.84
btns
0.82
ControllerBase
0.79
ರು
0.79
Activations Density 0.043%