INDEX
Explanations
directory names in file paths
references to directory structures and file organization
New Auto-Interp
Negative Logits
conduct
-0.78
acted
-0.74
acting
-0.73
uca
-0.72
pter
-0.71
Ò
-0.71
akening
-0.70
oria
-0.68
iating
-0.67
aughs
-0.67
POSITIVE LOGITS
directory
1.02
~/.
1.00
'/
0.97
folder
0.97
"/
0.96
(/
0.93
ystem
0.92
folders
0.91
=/
0.91
hierarchy
0.90
Activations Density 0.041%