INDEX
Explanations
parenthetical expressions or citations
New Auto-Interp
Negative Logits
CreateTagHelper
-0.60
Abit
-0.49
side
-0.47
Hayley
-0.46
AnimationsModule
-0.46
Portale
-0.45
side
-0.43
dated
-0.42
adv
-0.41
adv
-0.41
POSITIVE LOGITS
CD
1.49
SD
1.40
FD
1.39
PD
1.39
CD
1.35
BD
1.35
GD
1.33
TD
1.28
cd
1.27
GD
1.25
Activations Density 0.600%