INDEX
Explanations
possessive pronouns and references to divinity
New Auto-Interp
Negative Logits
Hollow
-0.16
IMIT
-0.14
leur
-0.14
ecast
-0.14
lectric
-0.13
Fac
-0.13
hattan
-0.13
brids
-0.13
odiac
-0.13
hollow
-0.13
POSITIVE LOGITS
ify
0.15
chy
0.15
atel
0.14
_fps
0.14
afd
0.14
apon
0.14
jer
0.14
alom
0.14
erek
0.14
eni
0.13
Activations Density 0.038%