INDEX
Explanations
references and mentions related to hacking
New Auto-Interp
Negative Logits
itives
-0.86
aution
-0.82
sembly
-0.81
orem
-0.80
uctor
-0.79
itionally
-0.79
itors
-0.78
raints
-0.76
fman
-0.75
orship
-0.74
POSITIVE LOGITS
jee
0.94
butt
0.94
extraord
0.89
geist
0.84
Berry
0.82
idge
0.78
TY
0.77
lite
0.73
intosh
0.72
clips
0.71
Activations Density 0.016%