INDEX
Explanations
technical terms or introductory phrases
instances of the word "introducing" and related variations
New Auto-Interp
Negative Logits
avorite
-0.71
Immunity
-0.68
Mara
-0.68
Fenrir
-0.64
sleeper
-0.62
Graves
-0.60
Pend
-0.60
raint
-0.60
Mankind
-0.60
Pixie
-0.59
POSITIVE LOGITS
ctory
1.59
ctions
1.39
cing
1.32
ce
1.18
cé
1.14
pta
1.14
gment
1.09
ctive
1.08
cer
1.05
ction
1.05
Activations Density 0.045%