INDEX
Explanations
numerical references to specific years
New Auto-Interp
Negative Logits
ledged
-0.65
illet
-0.61
ndra
-0.59
edge
-0.59
Edge
-0.58
Tu
-0.56
marqu
-0.55
hungry
-0.54
distingu
-0.54
lightsaber
-0.54
POSITIVE LOGITS
-'
0.95
å¹
0.87
ĸļ
0.84
ãĥŁ
0.77
onwards
0.76
BCE
0.67
onward
0.65
–
0.63
é¾įå¥ij士
0.62
chev
0.62
Activations Density 0.067%