INDEX
Explanations
references to confidence and self-assurance
New Auto-Interp
Negative Logits
ozem
-0.17
ardon
-0.17
ward
-0.15
ewise
-0.14
WaitForSeconds
-0.14
//{{-0.14
.Middle
-0.14
.Msg
-0.14
gren
-0.14
WARD
-0.14
POSITIVE LOGITS
/conf
0.19
Vak
0.17
uchs
0.16
otto
0.16
nest
0.15
confidence
0.15
vale
0.15
icy
0.15
y
0.15
forth
0.15
Activations Density 0.012%