INDEX
Explanations
names of individuals
proper nouns, specifically names of individuals and organizations
New Auto-Interp
Negative Logits
ccording
-0.92
confir
-0.75
acknow
-0.69
irony
-0.65
flared
-0.64
acron
-0.64
infographic
-0.63
referen
-0.62
PHOTO
-0.61
ģĸ
-0.60
POSITIVE LOGITS
anski
0.83
insky
0.82
iani
0.80
zon
0.76
usky
0.76
len
0.76
oliath
0.76
inski
0.75
existent
0.75
ovich
0.74
Activations Density 0.275%