INDEX
Explanations
metrics and statistics related to performance and scoring
New Auto-Interp
Negative Logits
anson
-0.14
obar
-0.14
anton
-0.14
owitz
-0.14
ben
-0.14
identity
-0.14
bar
-0.13
aca
-0.13
Listings
-0.13
ilik
-0.13
POSITIVE LOGITS
λα
0.17
era
0.17
eral
0.16
ubl
0.15
óst
0.15
_nbr
0.15
htable
0.14
_ub
0.14
orna
0.14
sid
0.14
Activations Density 0.047%