INDEX
Explanations
references to specific individuals and their achievements
New Auto-Interp
Negative Logits
acket
-0.16
acias
-0.15
uling
-0.15
enco
-0.15
Dash
-0.15
ertil
-0.14
unos
-0.14
dust
-0.14
-Sah
-0.14
mlin
-0.14
POSITIVE LOGITS
oct
0.24
concede
0.24
conced
0.23
conf
0.21
oct
0.21
attrib
0.20
confer
0.20
off
0.20
donate
0.20
don
0.19
Activations Density 0.057%