INDEX
Explanations
file-related terms or actions
references to files
New Auto-Interp
Negative Logits
rosso
-0.91
¥µ
-0.82
asy
-0.70
ghai
-0.69
idols
-0.68
agonists
-0.67
hops
-0.67
obia
-0.64
soph
-0.63
Arist
-0.63
POSITIVE LOGITS
file
3.65
files
2.66
file
2.65
File
2.61
File
2.46
FILE
2.25
Files
1.97
FILE
1.84
files
1.79
filename
1.74
Activations Density 0.010%