INDEX
Explanations
words related to biological measurements and assessments
New Auto-Interp
Negative Logits
ivate
-0.19
àµį
-0.19
iment
-0.16
pitch
-0.16
.scalablytyped
-0.16
ikut
-0.16
Mell
-0.15
ician
-0.15
pri
-0.15
ivism
-0.15
POSITIVE LOGITS
orphic
0.27
orph
0.27
etric
0.27
ycin
0.25
orphism
0.23
agnet
0.23
etry
0.21
agnetic
0.21
olecular
0.21
inded
0.20
Activations Density 0.100%