INDEX
Explanations
references to the concept of "nature" in various contexts
New Auto-Interp
Negative Logits
Vect
-0.15
398
-0.14
rect
-0.14
abet
-0.14
ap
-0.14
495
-0.14
Vig
-0.14
ersion
-0.13
ông
-0.13
ork
-0.13
POSITIVE LOGITS
onden
0.16
ruk
0.15
iet
0.15
.URI
0.15
вÑĸ
0.15
claimer
0.15
historic
0.15
inkle
0.15
olle
0.15
کرÛĮ
0.14
Activations Density 0.037%