INDEX
Explanations
mentions of professors or those with academic titles
New Auto-Interp
Negative Logits
ographed
-0.17
ennes
-0.16
fred
-0.15
ensis
-0.15
isha
-0.15
iferay
-0.14
agra
-0.14
erra
-0.14
KEEP
-0.14
ern
-0.13
POSITIVE LOGITS
ession
0.18
Emer
0.17
shima
0.15
ácil
0.14
voke
0.14
¬ģ
0.14
thalm
0.14
ofs
0.14
taire
0.14
cház
0.14
Activations Density 0.018%