INDEX
Explanations
countries
mentions of the country "Namibia."
New Auto-Interp
Negative Logits
thumb
-0.66
inhib
-0.65
grounding
-0.64
textbooks
-0.63
posit
-0.63
yeast
-0.59
hypertension
-0.58
outgoing
-0.58
subparagraph
-0.56
gradient
-0.56
POSITIVE LOGITS
eless
1.19
ibia
1.16
azing
0.99
nam
0.98
urai
0.96
aste
0.96
aran
0.92
borgh
0.92
ask
0.91
eli
0.90
Activations Density 0.023%