INDEX
Explanations
names of people in news articles or reports
proper nouns and names associated with individuals or entities
New Auto-Interp
Negative Logits
aged
-0.84
tons
-0.81
dit
-0.79
noon
-0.76
ahime
-0.76
ragon
-0.75
many
-0.73
Totem
-0.73
berry
-0.70
age
-0.69
POSITIVE LOGITS
fn
1.08
fn
0.81
FN
0.77
pload
0.75
tremend
0.73
umpy
0.73
umbnails
0.72
ilee
0.70
ļéĨĴ
0.69
guiActiveUn
0.68
Activations Density 0.024%