INDEX
Explanations
concepts related to systems thinking and systemic change
New Auto-Interp
Negative Logits
ure
-0.20
ion
-0.17
iger
-0.16
åΏ
-0.16
ansson
-0.15
ugi
-0.15
pine
-0.15
ại
-0.15
лег
-0.14
immer
-0.14
POSITIVE LOGITS
atics
0.30
atic
0.23
atically
0.21
ically
0.21
-wide
0.21
atische
0.20
wide
0.20
atik
0.20
UnderTest
0.20
عاÙħÙĦ
0.19
Activations Density 0.077%