INDEX
Explanations
words related to medical research and immune system properties
New Auto-Interp
Negative Logits
bye
-0.55
loe
-0.54
nen
-0.53
Square
-0.53
cyl
-0.51
Hover
-0.50
Puzzles
-0.49
umbnail
-0.49
tsky
-0.48
uously
-0.48
POSITIVE LOGITS
enhagen
0.66
ochemistry
0.58
PsyNetMessage
0.57
forcement
0.53
omnia
0.53
transpl
0.52
UNE
0.52
ersive
0.52
iferation
0.51
iries
0.50
Activations Density 16.009%