INDEX
Explanations
words related to scientific concepts and technical terms, particularly in physics and biology
terms related to physical or scientific properties and classifications
New Auto-Interp
Negative Logits
arrogance
-0.71
mock
-0.71
complaining
-0.70
Therapy
-0.69
couch
-0.69
envy
-0.68
approve
-0.68
stew
-0.67
wont
-0.65
frustration
-0.64
POSITIVE LOGITS
itudinal
1.05
onal
1.05
otropic
1.05
ogeneous
1.00
olar
1.00
omorphic
1.00
ucle
0.99
atern
0.98
llular
0.97
onic
0.97
Activations Density 0.170%