INDEX
Explanations
the conditional word "if"
New Auto-Interp
Negative Logits
Eye
-0.78
Catalog
-0.69
ocker
-0.67
incial
-0.66
cience
-0.65
emetery
-0.64
¥ŀ
-0.64
aven
-0.63
gate
-0.62
igned
-0.62
POSITIVE LOGITS
anyone
1.02
fy
0.96
there
0.95
anybody
0.94
they
0.90
anything
0.89
you
0.80
any
0.78
it
0.76
rame
0.74
Activations Density 0.050%