INDEX
Explanations
references to various foundations and their activities
New Auto-Interp
Negative Logits
oline
-0.18
reff
-0.18
union
-0.16
ingen
-0.16
Foundation
-0.16
275
-0.15
sik
-0.15
isions
-0.15
baz
-0.15
asso
-0.15
POSITIVE LOGITS
ally
0.20
ality
0.19
aries
0.18
lation
0.17
lay
0.17
ary
0.17
arity
0.17
/Foundation
0.17
aire
0.17
hei
0.16
Activations Density 0.021%