INDEX
Explanations
important locations and focal points within various contexts
New Auto-Interp
Negative Logits
olle
-0.16
новид
-0.15
iez
-0.15
cdc
-0.14
تÙĪØ³
-0.13
inen
-0.13
uren
-0.13
ings
-0.13
Sawyer
-0.13
.FLAG
-0.13
POSITIVE LOGITS
attention
0.24
gravity
0.24
activity
0.22
everything
0.22
gravity
0.21
Attention
0.20
hub
0.19
point
0.19
Gravity
0.19
universe
0.19
Activations Density 0.062%