INDEX
Explanations
discussion points about game mechanics and their implications
New Auto-Interp
Negative Logits
ucch
-0.16
!↵↵↵↵↵↵
-0.14
wcs
-0.14
orWhere
-0.14
pany
-0.14
evi
-0.14
âĻª↵↵
-0.13
!“↵↵
-0.13
ï¼ģï¼ģ↵↵
-0.13
*=*=
-0.12
POSITIVE LOGITS
IM
0.72
IMO
0.72
IMO
0.65
im
0.60
IM
0.56
imo
0.51
(IM
0.49
ime
0.43
IME
0.40
_IM
0.40
Activations Density 0.505%