INDEX
Explanations
concepts related to systemic change and collaboration
New Auto-Interp
Negative Logits
akov
-0.18
uebas
-0.14
tre
-0.13
ÅĻes
-0.13
bles
-0.13
assy
-0.13
_SIGNAL
-0.13
å°ĺ
-0.12
egas
-0.12
udos
-0.12
POSITIVE LOGITS
change
0.68
change
0.61
Change
0.57
-change
0.57
Change
0.54
CHANGE
0.52
_change
0.49
CHANGE
0.48
.change
0.47
(change
0.46
Activations Density 0.345%