INDEX
Explanations
the word "Nam" followed by a single character for a proper noun or abbreviation
references to a specific entity or individual, particularly "Nam."
New Auto-Interp
Negative Logits
posit
-0.66
grounding
-0.59
inference
-0.59
inhib
-0.59
almonds
-0.59
understatement
-0.58
yeast
-0.58
DW
-0.58
inhibition
-0.57
HEAD
-0.57
POSITIVE LOGITS
ibia
1.15
eless
1.05
azing
1.00
pered
0.93
aji
0.93
pty
0.92
essage
0.90
ankind
0.89
achu
0.89
our
0.86
Activations Density 0.029%