INDEX
Explanations
phrases related to explaining processes or mechanisms
New Auto-Interp
Negative Logits
оба
-0.16
اذ
-0.16
جب
-0.15
ÑĢÑĥк
-0.15
ahlen
-0.15
клад
-0.15
unnel
-0.15
ëĵľ
-0.14
.mousePosition
-0.14
gregate
-0.14
POSITIVE LOGITS
principle
0.17
principals
0.17
principles
0.15
Principle
0.15
aira
0.15
sko
0.14
principio
0.14
behind
0.14
Principal
0.14
олоÑģ
0.14
Activations Density 0.129%