INDEX
Explanations
references to various sectors or industries
New Auto-Interp
Negative Logits
joy
-0.17
vy
-0.16
ertime
-0.16
yan
-0.16
aign
-0.16
è¡ĮæĶ¿
-0.15
sis
-0.15
sy
-0.15
ery
-0.15
itudes
-0.14
POSITIVE LOGITS
ial
0.35
ally
0.25
al
0.24
IAL
0.21
ialized
0.19
ials
0.19
wide
0.18
ially
0.18
åĪ¥
0.18
-wide
0.17
Activations Density 0.016%