INDEX
Explanations
terms related to global concepts and competition
New Auto-Interp
Negative Logits
ikel
-0.09
idel
-0.07
cken
-0.07
sse
-0.07
ORY
-0.07
seau
-0.07
ابت
-0.07
esis
-0.07
awai
-0.07
leri
-0.06
POSITIVE LOGITS
/global
0.12
/local
0.12
warming
0.12
ToLocal
0.12
/world
0.11
-wide
0.10
isation
0.10
ized
0.10
izing
0.10
-local
0.09
Activations Density 0.023%