INDEX
Explanations
terms related to the immune system and its functioning
New Auto-Interp
Negative Logits
pta
-0.18
دÙĨ
-0.18
instein
-0.16
füg
-0.15
jes
-0.15
kv
-0.15
ÙĨ
-0.15
ÙĨب
-0.14
innen
-0.14
ern
-0.14
POSITIVE LOGITS
ognito
0.17
476
0.15
395
0.15
unst
0.14
MMdd
0.14
Ã¥de
0.14
lien
0.14
orado
0.14
rive
0.14
against
0.14
Activations Density 0.020%