INDEX
Explanations
Bloody followed by Marys or Spear
New Auto-Interp
Negative Logits
urface
-0.82
sacar
-0.78
yogurt
-0.77
ACADEM
-0.76
äischen
-0.75
evrops
-0.73
Lio
-0.73
ilion
-0.72
KERNEL
-0.72
šu
-0.72
POSITIVE LOGITS
Bloody
2.25
Bloody
1.80
bloody
1.74
bloody
1.32
tomato
1.10
hair
1.08
Hair
1.05
celery
1.00
Hair
0.98
Tomato
0.98
Activations Density 0.011%