INDEX
Explanations
terms related to the immune system
New Auto-Interp
Negative Logits
abwe
-0.79
ymm
-0.73
Speed
-0.69
gencies
-0.66
imate
-0.64
peed
-0.62
gore
-0.61
own
-0.60
Calais
-0.60
asketball
-0.59
POSITIVE LOGITS
�士
0.77
Ambrose
0.76
aceous
0.74
�
0.73
nuts
0.72
OLD
0.72
Creed
0.71
ォ
0.70
osa
0.70
stars
0.67
Activations Density 0.025%