INDEX
Explanations
mentions or related words about different types of tools
references to tools used in various contexts
New Auto-Interp
Negative Logits
erest
-0.73
Sons
-0.71
ometown
-0.70
ouse
-0.70
Shal
-0.67
noon
-0.66
oning
-0.66
athan
-0.64
uates
-0.64
orus
-0.64
POSITIVE LOGITS
tools
1.36
tips
1.16
kit
1.12
tool
1.03
tools
1.02
Tools
1.02
guiActiveUn
0.98
Tools
0.98
levers
0.95
*/(
0.80
Activations Density 0.014%