INDEX
Explanations
references to numbers and their significance in various contexts
New Auto-Interp
Negative Logits
ing
-0.16
ве
-0.15
rop
-0.15
war
-0.15
ings
-0.15
ITTE
-0.14
ViewHolder
-0.14
seed
-0.14
idelberg
-0.14
er
-0.14
POSITIVE LOGITS
UpDown
0.20
eral
0.20
osity
0.18
ismatic
0.18
ical
0.17
ICAL
0.17
érique
0.17
Bris
0.17
rical
0.15
ERO
0.15
Activations Density 0.015%