INDEX
Explanations
people's ages
references to the ages of individuals
New Auto-Interp
Negative Logits
ascript
-1.00
mbuds
-0.99
Ô
-0.97
hovah
-0.93
ecause
-0.88
illance
-0.86
irtual
-0.84
ravings
-0.83
qqa
-0.83
thia
-0.82
POSITIVE LOGITS
Frenchman
1.10
lawmaker
0.98
duo
0.97
politician
0.97
singer
0.95
businessman
0.92
artist
0.91
senator
0.90
musician
0.89
rapper
0.89
Activations Density 0.078%