INDEX
Explanations
references to heirs or lineage
New Auto-Interp
Negative Logits
eer
-0.07
oto
-0.07
tainment
-0.07
OTO
-0.07
anton
-0.06
haut
-0.06
-0.06
urator
-0.06
lx
-0.06
ylvania
-0.06
POSITIVE LOGITS
loom
0.11
ship
0.10
apparent
0.08
locks
0.07
duk
0.07
ighth
0.07
atatype
0.07
unsafe
0.07
imate
0.07
iffs
0.07
Activations Density 0.002%