INDEX
Explanations
proper nouns with the last syllable "ner"
mentions of specific individuals or entities
New Auto-Interp
Negative Logits
DRAG
-0.74
Haram
-0.74
CPI
-0.66
risen
-0.64
aliases
-0.62
sea
-0.60
Skies
-0.59
Zh
-0.59
Silk
-0.59
Peaks
-0.59
POSITIVE LOGITS
ner
4.44
ners
2.85
NER
2.76
nery
1.89
ning
1.81
nings
1.54
ener
1.42
nered
1.37
nar
1.34
ned
1.23
Activations Density 0.021%