INDEX
Explanations
mentions of a specific entity or person named "Ner" with different numeric identifiers
proper nouns, particularly names of individuals
New Auto-Interp
Negative Logits
avorite
-0.71
carry
-0.68
UTC
-0.66
Constructed
-0.65
reaching
-0.64
EMP
-0.63
Breaking
-0.63
ERROR
-0.62
runaway
-0.60
oral
-0.60
POSITIVE LOGITS
getic
1.02
ding
0.98
gie
0.96
ners
0.94
ning
0.90
lund
0.88
ner
0.86
nen
0.86
opol
0.86
amic
0.84
Activations Density 0.018%