INDEX
Explanations
scientific measurements and terms related to physical properties and phenomena
New Auto-Interp
Negative Logits
å£
-0.15
etto
-0.15
insp
-0.14
862
-0.14
raj
-0.14
äºĭ
-0.14
SimpleName
-0.14
اطÙĦ
-0.14
ARB
-0.14
itol
-0.13
POSITIVE LOGITS
iw
0.17
olo
0.16
hlen
0.15
Dah
0.14
Dahl
0.14
Thor
0.14
cip
0.14
iant
0.13
ili
0.13
Strength
0.13
Activations Density 0.190%