INDEX
Explanations
scientific terminology related to physics and physical processes
New Auto-Interp
Negative Logits
USTOM
-0.17
iren
-0.15
icut
-0.15
ulur
-0.15
idor
-0.14
moci
-0.14
handc
-0.14
éº
-0.14
predecess
-0.13
_sur
-0.13
POSITIVE LOGITS
ÏĢή
0.16
opi
0.15
489
0.15
ollen
0.14
ذ
0.14
ilities
0.14
ocos
0.14
ÑıÑħ
0.13
олÑĮно
0.13
hu
0.13
Activations Density 0.117%