INDEX
Explanations
terms related to scientific concepts and theories
New Auto-Interp
Negative Logits
HQ
-0.16
ζÏĮ
-0.16
azel
-0.16
gap
-0.16
->__
-0.15
orst
-0.14
orris
-0.14
itar
-0.14
473
-0.14
Awake
-0.14
POSITIVE LOGITS
rel
0.45
Rel
0.32
relativ
0.27
-rel
0.25
Einstein
0.24
Rel
0.24
rel
0.24
ÙĨسب
0.23
Newton
0.23
Ein
0.23
Activations Density 0.109%