INDEX
Explanations
mathematical notation and symbols used in equations and functions
New Auto-Interp
Negative Logits
McDowell
-0.67
Chan
-0.62
Bela
-0.60
Slate
-0.58
Titus
-0.56
Rani
-0.56
Anastasia
-0.56
Lyn
-0.55
Slate
-0.54
Anastasia
-0.53
POSITIVE LOGITS
Lombard
0.71
laura
0.69
Woodward
0.67
Walther
0.65
Ventura
0.65
McCain
0.64
Galbraith
0.64
ald
0.64
Laura
0.64
Duckworth
0.64
Activations Density 0.586%