INDEX
Explanations
identifiers and numerical data related to metrics and statistics
New Auto-Interp
Negative Logits
Û³Ûµ
-0.17
Kendrick
-0.17
pent
-0.16
825
-0.16
815
-0.15
atat
-0.15
965
-0.15
925
-0.14
275
-0.14
Pent
-0.14
POSITIVE LOGITS
490
0.35
190
0.35
130
0.34
460
0.33
390
0.33
180
0.33
430
0.32
230
0.32
240
0.32
440
0.32
Activations Density 0.090%