INDEX
Explanations
proper nouns, specifically names of individuals
New Auto-Interp
Negative Logits
itarian
-0.14
quared
-0.14
/REC
-0.13
inou
-0.13
gro
-0.13
nap
-0.13
opoly
-0.13
anness
-0.13
ophy
-0.13
夫人
-0.13
POSITIVE LOGITS
reporting
0.18
reported
0.18
Reporting
0.16
æĬ¥éģĵ
0.16
Reporting
0.15
Staff
0.15
bureau
0.15
reporter
0.15
Bureau
0.15
Reported
0.15
Activations Density 0.123%